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OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 27, 2003, 10:47:27 ; Search time 1415.77 Seconds 

(without alignments) 
10489.161 Million cell updates/sec 

US-10-017-372E-14 
363 

1 gccctggataccaactactg gttcctgcaagtgcagctga 363 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2888711 seqs, 20454813386 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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Database 



GenEmbl : * 

1 : gb_ba : * 

2 : gbjitg : * 

3 : gb__in : * 
4 : gb_om : * 
5 : gb_ov : * 
6: gb_pat:* 
7 : gb_ph : * 
8: gbjpl:* 

9 : gb_pr : * 
1 0 .- gb_ro : * 
11: gb_StS:* 

12: gb_sy:* 
13: gb_un:* 
14: gb_vi:* 
15: em_ba:* 
16: em_fun:* 
1 7 : em_hum : * 
18: em_in:* 
1 9 : em_mu : * 
2 0 : em_om : * 
21: em_or:* 
22: em_ov:* 
23: em_pat:* 
24: em_ph:* 
25: em_pl:* 
26: em_ro:* 
27: em StS:* 



2 8 : em_un : * 
29: em_vi:* 

3 0 : em_htg_hum : * 
3 1 : em_htg_inv : * 

3 2 : em_htg_o ther : * 

33: em_htg_mus : * 

34: em_htg_pln:* 

3 5 : em_htg_rod : * 

36: em_htg_mam:* 

37: em_htg_vrt:* 

38: em_sy:* 

3 9 : em_htgo_hum : * 

4 0 em_htgo_mus : * 

4 1 : em_htgo_other : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AX338213 

LOCUS AX338213 1326 bp DNA linear PAT 09-JAN-2002 

DEFINITION Sequence 1 from Patent WO0181404. 
ACCESSION AX338213 

VERSION AX338213.1 GI: 18128750 

KEYWORDS 

SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetart iodactyla / Suina; Suidae; Sus. 
REFERENCE 1 

AUTHORS Strober,W., Nakamura,K., Kitani,A. and Fuss, I. J. 

TITLE Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

JOURNAL Patent: WO 0181404-A 1 01-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
FEATURES Location/Qualifiers 
source 1. .1326 

/organism="Sus scrofa" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9823" 
CDS 16. .1188 

/ no te= "unnamed protein product" 
/codon_start=l 
/protein_id="CAD20538 .1" 
/db_xref ="GI : 18128751" 

/ trans 1 a t ion= " MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
R I EA I RGQ I L S KLRLAS P P S QG D V P PG P L P EA VLAL YNS TR DR VAG E S VE P E P E P EAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYS KVLAL YNQHN PGASAAPCCVPQALE PL P I VYYVGRKP KVEQLSNM I VRS CKCS " 

BASE COUNT 263 a 438 c 392 g 233 t 

ORIGIN 



Query Match 84.5%; 
Best Local Similarity 99.4%; 
Matches 308; Conservative 



Score 3 06.8; DB 6; 
Pred. No. 5.2e-52; 
0; Mismatches 2; 



Length 1326; 

Indels 0; Gaps 0; 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



54 



879 



114 



CAAGGAGAAGAACTG CTG CGTG CGG CAG CTCTACATTGA CTTCCGGAAGGAC CTGGG CTG 

II I ! Ml 1 1 1 1 1 II I ' 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 ,! 1 1 1 1 1: II 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 ! I 

CACGGAGAAGAACTG CTG CGTG CGG CAG CTCTACATTGACTTCCGGAAGGAC CTGGG CTG 



GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

M 1 1 1 1 1 II 1 1 i 1 1 ; 1 1 1 II I i 1 1 1 1 1 1 '1 1 1 1 1 1 ' 1 1 II 1 1 i 1 1 1 1 1 1 1 M'l I II 1 1 

93 9 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 



113 



938 



173 



998 



174 a\TCTGGAGCCTAGAC^CT(^GTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 23 3 

II II M I M I II II II I M II M II I 11 1 II II 1 1 M II It M 1 1 M I M I II I M M I 

999 CATCTGGAGCCTAGACACTC^GTACAGC^ 1058 
234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

I M I M M M I M III M 1 1 1 1 1 1 1 1 II II 1 1 1 1 II M II 1 1 1 1 1! MM 1 1 1 1 1 1 1 1 M 

1059 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1118 
2 94 CTACGTGGGCCG(^GCC(^GGTGGAG(^GCTGTCC^CATGATCGTGCGTTCCTGC^A 353 

M II 1 1 1 MM MM I II 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 II II 1 1 1 1 III M 1 1 1 M 1 1 II I 

1119 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 1178 

354 GTGCAGCTGA 3 63 

MIMIMII 
1179 GTGCAGCTGA 1188 



RESULT 2 

SSTGFBR 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



SSTGFBR 1605 bp mRNA linear MAM 27-MAR-1995 

Porcine mRNA for transforming growth factor-beta (TFG) precursor. 
Y00111 

Y00111.1 GI:2129 
transforming growth factor-beta. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R. and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Location/Qualifiers 
1. .1605 

/organism="Sus scrofa" 
/ mo 1 _ t ype = " mRNA " 
/db_xref = " taxon : 9823 " 
/tissue_type=" ovary" 
/clone_lib=" lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390) 11 

/codon_start=l 

/protein_id="CAA68291 .1" 

/db__xref = "GI :213 0" 

/db_xref = " SWI SS - PROT : P0 72 0 0 " 

/translat ion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 



mat_peptide 



misc feature 



misc feature 



BASE COUNT 
ORIGIN 



298 a 



YYAKEVTRVLMLESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVE INGFNSGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1238. .1573 

/product = ,f mature TFG-beta" 
647. .655 

/note="pot. N-glycosylation site" 
809. .817 

/note="pot. N-glycosylation site" 
572 c 437 g 298 t 



Query Match 84.5%; 
Best Local Similarity 99.4%; 
Matches 308; Conservative 



Score 3 06 . 8 ; DB 4; 
Pred. No. 5.1e-52; 
0; Mismatches 2; 



Length 1605; 



Indels 



0; Gaps 



0; 



Qy 



Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 1 1 1 1 1 1 1 1 1 i 1 1 : II 1 1 ' 1 1; 1 1 II , 1 1 M 1 M M 1 1 1 1 1 1 1 1 '1 1 1 1 1 1 1 II I 

12 67 CACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 1326 



Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I II II M M 1 1 1 M I M MM I Ml M I II II 1 1 1 II II III I II I I II I M 1 1 1 II I 

Db 1327 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 1386 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



174 CATCTGGAGCCTAGACACTCAGTACAAC^GGTCCTGGCTCTGTACAACCAGCACAACCC 2 33 

■I 1 1 1 1 1 n i m 1 1 1 'i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 ii i ii 1 1 ii 1 1 1 'i 1 1 1 1 1 1 

1387 (^TCTGGAGCCTAGAC^CTCAGTAC^GC^GGTCCTGGCTCTGTACAACCAGCAC^ACCC 144 6 



234 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

ii i < 1 1 ii i ii ii I ii 1 1 1 ii; 1 1 iii m ii i in M ii i ii iii ii ii i ii mi ii I ii 

144 7 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 15 06 



294 



353 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

M MM II Mill ML'II M, llllll I llllll IIIIIIIMIII lllll ! lll MM 

1507 CTACGTGGGCCGC^GCCCAAGGTGGAG(^GCTGTCC^(^TGATCGTGCGTTCCTGa^ 1566 
3 54 GTGCAGCTGA 363 

MIIIIMM 

1567 GTGCAGCTGA 1576 



RESULT 3 

GGTGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



GGTGFB1 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI: 63808 

transforming growth factor-beta 1. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart iodactyla ; Suma; Suidae; 
1 (bases 1 to 1750) 

Jakowlew,S.B. , Dillard, P . J . , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 



Euteleostomi ; 
Sus. 



(TGF-beta 1) 

JOURNAL Nucleic Acids Res. 16 (17), 8730 (1988) 
MEDLINE 88335639 
PUBMED 3166520 
REFERENCE 2 (bases 1 to 1750) 
AUTHORS Jakowl ew , S . B . 
TITLE Direct Submission 

JOURNAL Submitted (14- JUL-1988) Jakowl ew S . B . , National Institute of 

health, National Cancer Institute, Laboratory of Chemopr event ion, 
Building 41, Room B902, Bethesda, Maryland 2 0892, USA 
COMMENT The submitters believe that the chicken cDNA library was 

contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 
FEATURES Locat lon/Qual i f iers 

source 1. .1750 

/organism="Sus scrota" 
/mol_type="mRNA M 
/strain="white leghorn" 
/db_xref="t axon: 9823" 
/clone="pTGFB-ChX119" 
/eel l_type= " chondrocyte " 
gene 1. .1750 

/gene="TGF-beta 1" 
5'UTR 1. .446 

/gene= n TGF-beta 1" 
CDS 447. .1622 

/gene=" TGF-beta 1" 
/codon_start=l 

/product=" transforming growth factor" 
/protein_id="CAA30933 . 1" 
/db_xref ="GI : 638 0 9" 
/db_xref = " SWI SS - PROT : P0 72 0 0 " 

/ 1 rans 1 a t ion= 11 MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
mat_peptide 1285. .1619 



3 'UTR 

BASE COUNT 
ORIGIN 



325 a 



/gene= "TGF-beta 1" 
1620. .1750 
/gene="TGF-beta 1" 
627 c 479 g 



319 t 



Query Match 84.5%; Score 306.8; DB 4; Length 1750; 

Best Local Similarity 99.4%; Pred. No. 5.1e-52; 

Matches 308; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAG AACTGCTG CGTGCGG GAG CTCTA CATTGA CTT C CGGAAGGAC CTGGG CTG 113 

M I I I I ' I I I I I I I ' I 

Db 1313 CACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 1372 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I I I I , I , ' I , | | I | 

Db 1373 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 1432 



Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCT 233 

'I II 1 1 1 1 1 1 1 II 1 1 1 1 !i 1 1 ! 1 1 1 1 II I Ml III Mill. IIIMIIIII I II Ml II 

Db 14 33 C^TCTGGAGCCTAGACACT(^GTACAGC^GGTCCTGGCTCTGTACAACCAGCACAACCC 14 92 

Qy 234 GGGCGCGTCGGGGGCGCCGTGCTGGGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 14 93 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGGTGGAGCCACTGCCCATCGTGTA 15 52 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

illlll MMMM MM III IMI III M M Ml I Mill I IMIMMI I II I III 

Db 1553 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 1612 

Qy 354 GTGCAGCTGA 363 

IIIMIIIII 

Db 1613 GTGCAGCTGA 1622 



RESULT 4 
AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFB1) mRNA, complete 
cds . 

AF461808 

AF461808.1 GI: 18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus . 

1 (bases 1 to 2221) 

Wimmers,K., Chomdej , S . , Ponsuksili, S. and Schellander, K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpubl ished 

2 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksili , S . and Schellander, K. 
Direct Submission 

Submitted (20-DEC-2001) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Location/Qual if iers 

1. .2221 

/ organism= " Sus scrofa " 

/mol_type= "mRNA" 

/db_xref = n taxon: 9823" 

/chromosome= " 6 11 

1. .2221 

/ gene= M TGFB1 " 

1. .1173 

/gene="TGFBl" 

/not e = 11 cyt ok ine " 

/codon_start=l 

/product^" transforming growth factor beta 1" 
/protein_id="AAL57902 .1" 
/db_xref ="GI : 18042251" 

/ 1 rans 1 a t i on= " MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 



3 'UTR 

BASE COUNT 
ORIGIN 



KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDIATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1174. .2221 
/gene="TGFBl" 
504 a 616 c 649 g 452 t 



Query Match 84.5%; Score 306.8; DB 4; 

Best Local Similarity 99.4%; Pred . No. 4.9e-52; 
Matches 308; Conservative 0; Mismatches 2; 



Length 2221; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



54 CAAGGAG AAG AA CTG CTG CG TG CGG CAG CT C TA CATTG A CTT C CG GAAGGAC CTGGG CTG 113 

II 1 1 1 i ' 1 1 1 II I II 1 1, 1; II 1 1 1 1 1 1 1 1 1 1 1 , Ml I Ml 1 1 1 1 1 1 1 1 1 1 , 1 1 1 III 

8 64 CACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 923 



Qy 

Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I IM II MM M IMIMM M I IMMM III MM IMMMMIMM M III Ml 

924 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 983 
174 C^TCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCT 233 

M I M 1 1 MMM M M 1 1 M M M II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 1 1 1 1 

984 CATCTGGAGCCTAGACACTC^GTAC^GC^GGTCCTGGCTCTGTAC^CC^GCACAACCC 104 3 



Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

1 1 1 M 1 1 1 IMM 1 1 1 1 1 1 1 1 1 M IM I II 1 1 1 II II 1 1 1 II M I Ml I MM M I MMM 

Db 1044 GGGCG CGT CGG CGG CG CCGTG CTG CGTG CCG CAGG CG CTGGAG CCACTGC C CAT CGTGTA 1103 



Qy 2 94 CTACGTGGGCCGC^GCCC^GGTGGAGOVGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

M 1 1 1 1 1 1 M 1 1 II Ml I II 1 1 M IM 1 1 1 1 1 1 1 1 II 1 1 1 II M M I! 1 1 M I! I Mill 

Db 1104 CTACGTGGGCCGC^GCC(^GGTGGAG(^GCTGTCC^ACATGATCGTGCGTTCCTGCAA 1163 

Qy 3 54 GTGCAGCTGA 363 

llllllllll 
Db 1164 GTGCAGCTGA 1173 



RESULT 5 

PIGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 



MAM 31-MAR-1995 
complete cds . 



PIGTGFB1A 32 06 bp mRNA linear 

Sus scrofa transforming growth factor beta-1 mRNA 
M23703 

M23703 .1 GI : 755044 
transforming growth factor-beta- 1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; 
1 (bases 1 to 3206) 

Kondaiah,P., Van Obberghen-Schil 1 ing , E . , Ludwig,R.L 
Sporn,M.B. and Roberts, A. B. 

cDNA cloning of porcine transforming growth factor-beta 1 mRNAs 

Evidence for alternate splicing and polyadenylat ion 

J. Biol. Chem. 263 (34), 18313-18317 (1988) 

89054010 

2461367 



Euteleostomi ; 
Sus . 

, Dhar,R., 



COMMENT 



FEATURES 

source 



gene 



CDS 



polyA_signal 
polyA_site 



On Apr 1, 1995 this sequence version replaced gi : 341017. 

Original source text: Sus scrofa (strain miniature swine) cDNA to 

mRNA . 

Location/Qualifiers 
1. .3206 

/organi sm= " Sus s crof a " 
/mol_type="mRNA" 
/strain="miniature swine" 
/db_xref ="taxon: 9823" 

/eel l__type= "peripheral blood lymphocyte" 
1. .3206 

/gene= "TGF-beta - 1 11 
906. .2078 
/gene= "TGF-beta- 1" 
/ codon__start = l 

/product = " transforming growth factor-beta - 1 " 
/protein_id="AAA64616. 1" 
/db_xref="Gl : 755045" 

/ trans lation= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVTjMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KL.KVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYS KVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS " 
3186. .3191 



BASE COUNT 
ORIGIN 



/gene="TGF-beta-l" 
3206 

/gene= "TGF-beta- 1" 
645 a 1041 c 924 g 



596 t 



Query Match 84.5%; Score 3 06.8; DB 4; Length 32 06; 

Best Local Similarity 99.4%; Pred. No. 4.7e-52; 

Matches 308; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I II I II M 1 1 i 1 1 1 M 1 1 1 1 M 1 1 1 1 II I M I Ml I Ml M 1 1 1 II II 1 1 II M 1 1 1 

Db 176 9 CACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 182 8 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I M III MM III I II I II 1 1 1 1 1 1 1 MM I Ml I III MM 1 1 1 Ml II I M Ml 1 1 1 

Db 182 9 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 1888 



Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAG 233 

i II 'II I III II ' 1 1 ' III Ml 1 1 1 1 II II I III II M II 1 1 1 MM II I Ml II 1 1 1 1 

Db 188 9 CATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTAC^ 194 8 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

1 1 II II I II M 1 1 1 1 1 1 M I II I M I II Ml II 1 1 M I M II 1 1 1 1 1 1 Ml I II III 1 1 1 

Db 194 9 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 008 

Qy 2 94 CTA CGTGGG C CG CAAG C C CAAGGTGG AG CAG CTGT C CAA CATG AT CGTG CGTT C CTG CAA 353 

1 1 1 1 II I II 1 1 II 1 1 1 1 M I II I M I II Ml II II M 1 1 1 Ml 1 1 1 1 1 Ml I II MM 1 1 

Db 2 00 9 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 2068 



Qy 



3 54 GTGCAGCTGA 3 63 



Db 



■ 1 1 1 I 1 1 ; I 

2 069 GTGCAGCTGA 2 078 



RESULT 6 
AR036686 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 

BASE COUNT 
ORIGIN 



AR036686 339 bp DNA linear PAT 29-SEP-1999 

Sequence 29 from patent US 5800811. 

AR036686 

AR036686.1 GI:5954542 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 339) 

Hall,F.L., Nimni,M.E., Tuan, T. -L. , Wu,L. and Cheung, D.T. 
Artificial skin prepared from coclagen matrix containing 
transforming growth factor- . beta . having a collagen binding site 
Patent: US 5800811-A 29 01-SEP-1998; 

Locat ion/Qual i f iers 

1. .339 

/ organism^ " unknown " 
66 a 113 c 100 g 60 t 



Query Match 77.5%; 
Best Local Similarity 94.2%; 
Matches 292; Conservative 



Score 281.2; DB 6; 
Pred. No. 8.9e-47; 
0; Mismatches 18; 



Length 33 9; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



54 



30 



CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 M 1 1 1 M: 1 1 1 1: 1'l I II 1 1 1 1 ! Ill III III III II Nihil Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM Nil; Mill 

90 GAAGTGGATCCATGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTC 233 

III MINIM INN INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

150 CATTTGGAGCCTGGAC^CGC^GTAC^GC^AGGTCCTGGCCCTGTACAACCAGCATAACCC 2 09 

234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

IN I I I I I I I I I I I I I [ I I I I I I I I I I I I [ I I I I I I I I I I I I I N I I I I I I I ! II 
210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 269 

2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC^TGATCGTGCGTTCCTGC^A 353 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMI 

27 0 CTACGTGGGCCGC^GCCC^AGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

3 54 GTGCAGCTGA 363 
330 GTGCAGCTGA 33 9 



RESULT 7 
AX455100 
LOCUS 



AX455100 



489 bp DNA linear PAT 06-JUL-2002 



DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



BASE COUNT 
ORIGIN 



Sequence 167 from Patent WO0208453. 
AX455100 

AX455100. 1 GI : 21714285 

Canis familiaris (dog) 
Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

1 

Farr,S.B., Pickett , G . G . , Neft,R.E. and Dunn.R.T. 
Canine toxicity genes 

Patent: WO 0208453-A 167 31-JAN-2002; 
Phase- 1 Molecular Toxicology (US) 

Location/Qualifiers 

1. .489 

/organism=" Canis famil iaris " 
/mol_type= "genomic DNA" 
/db_xref="taxon: 9615" 
84 a 183 c 145 g 76 t 1 others 



Query Match 77.5%; Score 281.2; DB 6; 

Best Local Similarity 94.2%; Pred. No. 8.5e-47; 
Matches 292; Conservative 0; Mismatches 18; 



Length 4 89; 
Indels 0; 



Gaps 



0; 



Qy 
Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

ii 1 1 1 1 1 ii 1 1 1 ii i ! 1 1 1 1 iiiiiiiiiiiiiiiiiiimii iiiii iiiiiiii 

101 CACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTG 16 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

IIMIMII IIIII IIIIIIII MMI II II 1 1 Ml M 1 1 1 1 1 M M I Mill 

161 GAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTA 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTC 

III IIIIIIII IIIII MM 1111,1111 1 1 1 MM I II 1 1 1 Ml Ml I Ml 

221 CATTTGGAGCCTGGACACGC^GTACAGCAAGGTCCTGGCCCTGTAOWVCCAGCACAACCC 



234 



281 



294 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

I M I M I II I M 1 1 1 1 1 1 III 1 1 1 Ml 1 1 M 1 1 1 M 1 1 1 MM I II 1 1 1 MMM I III 

GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

1 1 1 1 M I Ml 1 1 1 1 1 M M M 1 1 1 1 Ml I II M I M 1 1 1 II 1 1 1 1 1 1 1 1 1 MMM 

341 CTACGTGGGCCGC^GCCCAAGGTGGAGC^GCTGTCGAACATGATCGTGCGCTCCTGCAA 

354 GTGCAGCTGA 363 

II II I II M I 
4 01 GTGCAGCTGA 410 



173 



220 



233 



280 



293 



340 



353 



400 



RESULT 8 

0ATGFB1 

LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



1173 bp mRNA linear 
for transforming growth factor-beta I 



OATGFB1 
O.aries mRNA 
X76916 

X76916.1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1 



MAM 18-APR-1995 



SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Cetartiodactyla ; Ruminant ia; Pecora; Bovoidea; 
Ovis . 



the gene encoding ovine 



of Edinburgh, Dept. of 



mat_pept ide 



BASE COUNT 
ORIGIN 



Ovis aries (sheep) 
Ovis aries 
Eukaryota ; Metazoa 
Mammalia; Eutheria 
Bovidae; Caprinae; 
1 

Woodall , C. J. , McLaren, L.J. andWatt,N.J. 
Sequence and chromosomal localisation of 
latent transforming growth factor-beta 1 
Gene 150 (2), 371-373 (1994) 
95121932 
7821809 

2 (bases 1 to 1173) 
Woodall ,C. 
Direct Submission 

Submitted (24 -DEC-1993 ) C. Woodall, Univ 
Veterinary Pathology, Sc. of Vet. Studies, Univ. of Edinburgh, 
Edinburgh EH9 IQH, UK 

Locat ion/Qua 1 i f iers 
1. .1173 

/organism="Ovis aries" 
/mol_type="mRNA" 
/db_xref ="taxon: 994 0" 
1. .1173 
/codon_start=l 

/product = " trans forming growth factor-beta 1" 
/protein_id="CAA54242 . 1" 
/db_xref="GI : 496649" 
/ db_x r e f = "SWISS-PROT:P50414" 

/ trans lation="MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTIDMELVKRK 
GI EAI RGQI LSKLRLASPPSQGDVPPGPLPEAI LALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKI YDKMKSSSHS I YMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
835. .1170 

/product="transforming growth factor-beta 1" 
245 a 378 c 336 g 214 t 



Query Match 77. 5%; 

Best Local Similarity 94.2%; 
Matches 292 / Conservative 



Score 281.2; DB 4; 
Pred. No. 7.7e-47; 
0; Mismatches 18; 



Length 1173; 
Indels 0; 



Gaps 



Qy 

Db 



0; 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

1 1 II M MM 1 1 II MMMMMMM! MMMMMMMMMI 

8 64 CACAGAAAAGAACTGCTGTGTTCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 923 



Qy 

Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

1 1 1 M 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M MMM MM M M M M M Ml 

924 GAAGTGGATTCACGAACCCAAGGGCTACCACGCCAATTTCTGCCTGGGGCCCTGTCCCTA 



173 



983 



174 CATCTGGAGCCTAGAC^CTCAGTACAACAAGGTCCTGGCT 233 

1111:1111111 Mill 1 1 1 1 1 1 1 IMMIIMM t M M 1 1 1 1 1 1 1 1 1 i I ! 1 1 1 

984 CATCTGGAGCCTGGACACACAGTACAGCAAGGTCCTGGCCCT^ 104 3 



Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

Mini i iiiiiMiiii ii 1 i i i 

Db 104 4 GGGCGCATCGGCGGCGCCGTGCTGCGTGCCTCAGGCGCTGGAACCCCTGCCCATCGTGTA 1103 

Qy 2 94 CTACGTGGGCCGCAAGCCC^AGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

MMMl MMIM: MIM , I Ml I I 1 I I 

Db 1104 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAA 1163 

Qy 354 GTGCAGCTGA 363 

MUM Ml 
Db 1164 GTGCAGCTGA 1173 



RESULT 9 

DOGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 

FEATURES 

source 



gene 
5 'UTR 
CDS 



Euteleostomi ; 
Canis . 



DOGTGFB1A 1369 bp mRNA linear MAM 30-OCT-1994 

Canine transforming growth factor-beta 1 (TGFB1) mRNA, complete 
cds . 
L34956 

L34956. 1 GI :516071 

homologue; transforming growth factor-beta 1. 
Canis familiaris (dog) 
Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 
1 (bases 1 to 1369) 

Manning, A.M. , Auchampach, J. A. , Drong,R.F. and SI ightom, J . L. 
Cloning of a canine cDNA homologous to human transforming growth 
factor-beta 1 (TGFbetal) 
Unpublished (1994) 

Original source text: Canis familiaris adult jugular vein 
endothelial cDNA to mRNA. 

Locat ion/Qual i f i ers 

1. .1369 

/organism=" Canis f amil iaris" 

/mol_type= "mRNA" 

/db_xref ="taxon: 9615" 

/eel l_type=" LPS -activated" 

/tissue_type=" jugular vein endothelial" 

/ dev_stage= " adul t " 

1. .1369 

/gene="TGFBl" 

1. .57 

/ gene= " TGFB1 " 
58. .1230 
/gene="TGFBl" 

/ f unct ion= 11 ant i - inf lammatory agent " 

/not e= "precursor" 

/codon_start=l 

/product=" transforming growth factor-beta 1" 
/protein__id="AAA51458 .1" 
/db_xref="GI : 516072" 

/ translat ion= "MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRK 
RI EAI RGQI LSKLRLSSPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVENTNKI YEKVKKSPHSI YMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDI NGFSSSRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRQ 



RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
mat_peptide 103 9. .1227 

/gene= M TGFBl M 

/product= M transforming growth factor-beta 1" 
3'UTR 1231. .1369 

/gene="TGFBl" 

BASE COUNT 264 a 473 c 415 g 216 t 1 others 

ORIGIN 



Query Match 77.5%; Score 281.2; DB 4; Length 1369; 

Best Local Similarity 94.2%; Pred. No. 7.5e-47; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 



Qy 
Db 


54 
921 


CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 

II Mlllllllllllllll i 1 III MM llli 1 1 Mill 1 1 1 1 1 1 1 1 

CACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTG 


113 
980 


Qy 


114 


GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

MINIMI Mill IMMMI Mill II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 f 1 

GAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTA 


173 


Db 


981 


1040 


Qy 


174 


CA.TCTGGAGCCTAGACACTCyVGT^ 

III IMMMI MIM MMMI MIMMIMM 1 1 1 1 II 1 II II II 1 1 1 1 II 1 

CATTTGGAGCCTGGACACGCAGTA(^G(^GGTCCTGGCCCTGTAa^CCAGCACAACCC 


233 


Db 


1041 


1100 


Qy 


234 


GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 
GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 


293 


Db 


1101 


1160 


Qy 
Db 


294 
1161 


CTACGTGGGCCGCAAGCCa^GGTGGAGC^GCTGTCOUVC^TGATCGTGCGTTCCTGC^A 

1 1 1 1 1 1 1 1 1 1 II M M II 1 II 1 II 1 1 1 1 1 1 II 1 1 II M 1 1 1 1 1 1 1 1 M 1 1 IMMMI 

CTACGTGGGCCGC^GCCC^GGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCTGCAA 


353 
1220 


Qy 


354 


GTGCAGCTGA 3 63 




Db 


1221 


hi MIM 
GTGCAGCTGA 1230 





RESULT 10 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 



AGMTGFB 1561 bp mRNA linear PRI 27-APR-1993 

Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
M16658 

M16658.1 GI: 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 
Cercopithecinae; Cercopithecus. 
1 (bases 1 to 1561) 

Sharpies, K., Plowman, G. D. , Rose,T.M., Twardzik, D . R . and 
Purchio,A. F. 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 



MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



sig_peptide 
mat_peptide 



BASE COUNT 
ORIGIN 



87246074 
3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 

Location/Qualifiers 

1. .1561 

/organism="Cercopithecus aethiops" 
/mol__type=" mRNA" 
/db_xref ="taxon: 9534 " 
262. .1434 

/note="transf orming growth factor-beta precursor" 
/ codon_s t a rt = 1 
/protein_id="AAA35369 . 1" 
/db_xref="GI : 176553" 

/ trans lation="MPPSGLRLLPLLLPLLWLLVTiTPSRPAAGLSTCKTIDMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
262. .1095 

/note="transforming growth factor-beta signal peptide" 
1096. .1431 

/product=" transforming growth factor-beta" 
301 a 547 c 446 g 267 t 



Query Match 77. 5%; 

Best Local Similarity 94.2%; 
Matches 292; Conservative 



Score 2 81.2; DB 9; 
Pred. No. 7.4e-47; 
0; Mismatches 18; 



Length 1561; 
Indels 0; 



Gaps 



0; 



Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 1: 1 1 1 1 1 1 II 1 1 M I Ml 1 1 1 I M II IIIIIIMIII ''III Mill 

Db 112 5 CA CGGAGAAG AA CTG CTG CG TG CGG GAG CTG TATATTGA C TTC CG CAAGG A CCTCGGCTG 118 4 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Mill M II I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I 
118 5 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTA 124 4 

174 CATCTGGAGCCTAGACACTCAGTACAAC^ 233 

III IMMMI Mill I I I I I I I I f I I I I I I I I i I I I 1 I I I f I I I I I I I Mill 
124 5 (^TTTGGAGCCTGGACACG(^GTA(^GCmGGTCCTGGCCCTGTACAACC^GCATAACCC 13 04 

234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

Ml I I I I I I I I I I ] I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
13 05 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 13 64 



294 



353 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

j 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 IMMMI 

13 65 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1424 



Qy 

Db 



3 54 GTGCAGCTGA 363 

IMMMI 

142 5 ATGCAGCTGA 14 34 



RESULT 11 

A18277 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



A18277 339 bp mRNA linear PAT 17 -MAY- 1994 

H. sapiens TGF-beta 1 gene seq ID No:l. 

A18277 

A18277. 1 GI : 513237 

synthetic construct 
synthetic construct 
artificial sequences . 
1 (bases 1 to 339) 

Cerletti,N., McMaster , G . K. , Cox,D., Schmitz,A. and Meyhack,B. 
Process for the production of biologically active protein (e.g. 
TGF) 

Patent: EP 0433225-A 1 19-JUN-1991; 
CIBA-GEIGY AG 

Location/Qualifiers 

1. .339 

/organism= 11 synthetic construct " 
/mol_type="mRNA" 
/db_xr e f = " t axon : 3 2 6 3 0 " 
1. .339 

/note= M Protein sequence is in conflict with the conceptual 

translation" 

/codon_start=l 

/ trans l_table= 11 

/product = "TGF-beta 1" 

/protein_id="CAA01385 . 1" 

/db_xref="GI : 4 52 99 03" 

/ 1 rans 1 a t ion= " ALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFC 
LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLS 
NMIVRSCKCS" 
66 a 114 c 100 g 59 t 



Query Match 77.0%; 
Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 6; 
Pred. No. 1.9e-46; 
0; Mismatches 19; 



Length 33 9; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



54 



30 



CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 

II M 1 1 1 1 1 1 ! M 1 1 : 1 1 [ 1 1 i M 1 1 1 1 1 1 1 r 1 1 i 1 1 r I j MINIM Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 



113 



89 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



114 GAAGTGGATTCATGAACCCAAGGGCTACCA.TGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

III II II I Ml I MM MM Mill 1 1 MINIM MINIM INN 

90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 
174 CATCTGGAGCCTAGACACTC^GTACAACAAGGTCCTG 23 3 

Ml MINIM Mill I i 1 1 1 1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 Mill 

150 CATTTGGAGCCTGGA(^CGCAGTA(^GCAAGGTCCTGGCCCTGTACAAC(^GaVTAACCC 2 0 9 
234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MUM I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 269 
2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCyVGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

M I Ml II I M 1 1 MM M M M M Ml M I M I M 1 1 M III Ml Ml i I MINIM 



Db 



270 CTACGTGGGCCGCAAGCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGCTCCTGC^ 32 9 



Qy 



Db 



3 54 GTGCAGCTGA 363 

1,111.1 

33 0 GTGCAGCTGA 33 9 



RESULT 12 

A23751 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



BASE COUNT 
ORIGIN 



A23751 339 bp mRNA linear PAT 25-JAN-1995 

TGF-betal coding region. 

A23751 

A23751.1 GI:825585 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 {bases 1 to 339) 

McMaster , G . K. , Cox,D., Cerletti,N. and Kuhla,J. 
Novel hybrid transforming growth factors 
Patent: EP 0542679-A 1 19-MAY-1993; 
CIBA-GEIGY AG 

Loca t ion/ Qua 1 i f i ers 

1. .339 

/organism="Homo sapiens" 
/mo l_type= " mRNA " 
/db_xref ="taxon: 9606" 
66 a 114 c 100 g 59 t 



Query Match 77.0%; Score 279.6; DB 6; Length 339; 

Best Local Similarity 93.9%; Pred. No. 1.9e-46; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

I! II Ml I MMM MM 1 1 1 1 1 1 1 M I M M MM MMM Mill 

3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MMM II II MMMMIMMMMMI 11:111 1 1 MMM Mill 

90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTC 233 

Ml IMMIII MMI MMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

150 C^TTTGGAGCCTGGA(^CG(^GTA(^G(^GGTCCTGGCCCTGTACAACCAGC^TAACC^ 2 09 

234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MMM I I I I I I I I I I I I I I ! I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
210 GGG CG CCTCGG CGG CG CCGTG CTG CGTG C CG CAGG CG CTGGAG C CG CTG C C CATCGTGTA 269 

294 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 

27 0 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 



354 



GTGCAGCTGA 

Ml MM 



363 



Db 



330 GTGCAGCTGA 33 9 



RESULT 13 

A48549 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

COMMENT 
FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



A48549 339 bp DNA linear PAT 07-MAR-1997 

Sequence 1 from Patent WO9603432. 

A48549 

A48549.1 GI:2302319 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 339) 

Cerletti,N. 

NOVEL PROCESS FOR THE PRODUCTION OF BIOLOGICALLY ACTIVE DIMERIC 
PROTEI 

Patent: WO 9603432-A 1 08-FEB-1996; 
CIBA GEIGY AG (CH) 

Other publication AU 3109595 960222. 
Location/Qualifiers 
1. .339 

/organism= "unidentified" 
/mol_type="genomic DNA" 
/db_xref="taxon: 32644" 
1. .336 

/note="Protein sequence is in conflict with the conceptual 

translation" 

/codon_start=l 

/product =" HUMAN TGF-BETA1" 

/protein_id="CAA03113 .1" 

/db_xref="GI : 230232 0" 

/ translat ion="ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFC 
LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLS 
NMIVRSCKCS " 
66 a 114 c 100 g 59 t 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 6; 
Pred. No. 1.9e-4 6; 
0; Mismatches 19; 



Length 33 9; 



Indels 



0; Gaps 



0; 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 !l 1 1 1 M 1 1 1 1 1 1 1 1 i I i 1 1 < I M 1 1 1 1 1 1 1 1 1 1 II 1 1 llllllll Mill 

3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 89 



Qy 

Db 

Qy 

Db 

Qy 

Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMI Mill 

90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 
174 CATCTGGAGCCTAGAC^CTC^GTACAACAAGGTCCTGGCTCT^ 233 

III MINIM MMI IMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

15 0 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 209 



234 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

MMM MM Ml 1 1 1 1 1 1 ! 1 1 1 U 1 1, M 1 1 1 1 1 1 1 , 'I I I M 1 1 1 1 1 1 1 1 M I 

210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 6 9 



Qy 

Db 

Qy 
Db 



2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

III III III III III ll lllllll! I II llh 1 1 Mlllll! 1 1 M II 1 1 Ml. MM 

2 70 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 329 

3 54 GTGCAGCTGA 3 63 

lllllll I 
33 0 GTGCAGCTGA 33 9 



RESULT 14 

A48563 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 
FEATURES 

source 



CDS 



A48563 
Sequence 
A48563 
A48563 . 1 



339 bp 
1 from Patent WO9603433. 

GI :2302333 



DNA 



linear PAT 07-MAR-1997 



BASE COUNT 
ORIGIN 



unidentified 

unidentified 

unclassified . 

1 (bases 1 to 339) 

Cerletti,N. 

NEW PROCESS FOR THE PRODUCTION OF BIOLOGICALLY ACTIVE PROTEIN 
Patent: WO 9603433-A 1 08-FEB-1996; 
CIBA GEIGY AG (CH) 

Other publication AU 3109695 960222. 
Location/Qual if lers 
1. .339 

/organism= "unidentified" 
/mol_type= "genomic DNA" 
/db_xr e f = " t axon : 3 2 6 4 4 " 
1. .336 

/note=" Protein sequence is in conflict with the conceptual 

translation" 

/codon_start=l 

/product =" HUMAN TGF-BETA1 " 

/protein_id= M CAA03120. 1" 

/db_xref="GI : 2302334" 

/ translation "ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFC 
LGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLS 
NMIVRSCKCS " 
66 a 114 c 100 g 59 t 



Query Match 77.0%; Score 279.6; DB 6; Length 339; 

Best Local Similarity 93.9%; Pred. No. 1.9e-46; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 
Db 

Qy 
Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 'Mi Ml Ml III II lllllllll 'I I I i I I II I I I II I li 'lllllll Mill 
3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

lllllllll II II 1 1 II 1 1 1 i 1 1 1 1 1 IT 1 1 1 I [1,111 1 MINIM Mill 

90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 



Qy 174 CATCTGGAGCCTAGACACTC^GTACAACAAGGTC^ 233 



Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



150 



234 



III IN Mill 11:1' MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

CATTTGGAGCCTGGAC^CGCAGTAC^Ga^GGTCCTGGCCCTGTACAACCAGCATAACCC 



294 



209 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 



353 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 I II M M I 

27 0 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 
354 GTGCAGCTGA 363 

MMMMII 

33 0 GTGCAGCTGA 33 9 



RESULT 15 

156854 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 

BASE COUNT 
ORIGIN 



156854 
Sequence 
156854 
156854. 1 



33 9 bp 

1 from patent US 5650494. 
GI :2477267 



DNA 



linear 



PAT 07-OCT-1997 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 339) 

Cerletti,N., McMaster, G.Kent . , Cox,D., Schmitz,A. and Meyhack,B. 
Process for refolding recombinant ly produced TGF- . beta . -like 
proteins 

Patent: US 5650494-A 1 22-JUL-1997; 
Location/Qualifiers 



66 



1. .339 

/organ i sm= " unknown 11 
a 114 c 100 g 



59 t 



Query Match 77.0%; Score 279.6; DB 6; 

Best Local Similarity 93.9%; Pred. No. 1.9e-46; 
Matches 291; Conservative 0; Mismatches 19; 



Length 33 9; 
Indels 0; 



Gaps 



Qy 


54 


Db 


30 


Qy 


114 


Db 


90 


Qy 


174 


Db 


150 


Qy 


234 


Db 


210 



CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! I MINIM Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 



Mill' II II II II II M 



II 1 1 1 1 1 1 1 Mill 



Mill 



113 



89 



CATCTGGAGCCTAGACACTC^GTACAACAAGG 

Ml MIMI Mill I II 1 1 1 1 MINIMUM 1 1 1 1 1 II 1 1 1 1 1 1 1 Mill 

CATTTGGAGCCTGGACACG(^GTA(^GC^AGGTCCTGGCCCTGTACAACCAGCATAACCC 



233 



209 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MIMI I ! 1 1 1 1 i I II 1 1 : 1 1 ! 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 ! 1 1 1 1 1 NMMMNNM 

GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 6 9 



Qy 294 CTACGTGGGCCGCAA.GCCCAAGGTGGAGOVGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

ill I i I II I ; I M 1 1 III II I i 1 1 1 1 1 II 1 1 1 1 1; I i II I III I M 1 1 ! 1 1 llllllll 

Db 270 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

Qy 354 GTGCAGCTGA 363 

Db 33 0 GTGCAGCTGA 33 9 



Search completed: October 27, 2003, 18:34:52 
Job time : 1416.77 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 
Title: 

Perfect score: 
Sequence : 

Scoring table: 
Searched: 



October 27, 2003, 10:47:27 ; Search time 109.057 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 

US-10-017-372E-14 
363 

1 gccctggataccaactactg gttcctgcaagtgcagctga 363 

I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



2552756 seqs, 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



5105512 



Post -processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



N__Geneseq_19Jun03 : * 



1 

2 
3 
4 

5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 



/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/S I DS 1 /gcgda t a /geneseq/genes eqn - 
/SIDSl/gcgdata/geneseq/geneseqn- 
/SIDSl/gcgdata/geneseq/geneseqn 
/ S I DS 1 /gcgda t a /gene s eq/gene s eqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/ SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 
/SIDSl/gcgdata/geneseq/geneseqn 



embl/NA1980. 
embl/NA1981 . 
embl/NA1982 . 
embl/NA1983 . 
embl/NA1984 . 
embl/NA1985 . 
embl/NA1986 . 
embl/NA198 7 . 
embl/NA1988 . 
-embl/NA1989 
-embl/NA1990 
-embl/NA1991 
-embl/NA1992 
embl/NA19 93 
-embl/NA1994 
-embl/NA1995 



DAT: * 
DAT: * 
DAT:* 
DAT: * 
DAT: * 
DAT: * 
DAT: * 
DAT: * 
DAT: * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 
. DAT : * 



17 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NAl 996 . DAT: * 

18 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 . DAT : * 

19 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 .DAT: * 

20 : /SlDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT: * 

21: /SIDS1 /gcgda ta /geneseq/genes eqn - embl /NA2 00 0. DAT : * 

22 : / SIDSl/gcgdata/ geneseq/genes eqn -embl/NA2 001A. DAT : * 

23 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001B. DAT: * 

24 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 . DAT: * 

25 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2003 .DAT: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Descript ion 


1 


306 


8 


84 


5 


1326 


24 


AAD22696 


Porcine trans f ormi 


2 


281 


2 


77 


5 


339 


18 


AAT42771 


TGF-betal active f 


3 


281 


2 


77 


5 


339 


20 


AAV99375 


cDNA encoding a tr 


4 


281 


2 


77 


5 


489 


24 


ABL99528 


Target canine gene 


5 


281 


2 


77 


5 


1303 


11 


AAO093 17 


Mnnkpv trsn^fnrmi n 

i i i y l» ^ %>i i i u i- -i_ mill 




281 


2 


77 


5 


1559 


13 


AAQ2 028 9 


^emipncp pnrndina 


7 


281 


2 


77 


5 


1561 


11 


AAQ03268 


^itni An t~ ran^ f nrnni n 

X 1 1 1 X GlA A I* A. CA X .1 kj A- Vw/ A- 1 1 1 _L i A 


8 


281 


2 


77 


5 


1571 


1 1 


AAO03269 


Human t~ ran^ f nrmi i nn 


9 


279 


6 


77 


0 


339 


17 


AAT17235 


Human TfiF-bet~a 1 n 


10 


279 


6 


77 


0 


339 


20 


AAX15245 


pDMA pnrndina t~hp 


11 


279 


6 


77 


0 


650 


24 


ABK84023 


Human pDNA dif f prp 


12 


279 


6 


77 


0 


650 


24 


ARTifift ft 1 ft 




13 


279 


6 


77 


0 


1176 


24 


ABZ35738 


Huma n TG F beta 1 p 


14 


279 


6 


77 


0 


1176 


24 


ABX09981 


Human TGFbetal DNA 


15 


279 


6 


77 


0 


1176 


24 


ABV78162 


Human TGF beta 1 D 


16 


279 


6 


77 


0 


1176 


24 


ABL91703 


Human polynucleoti 


17 


279 


6 


77 


0 


1176 


25 


ABV753 91 


TGFB1 Arg25Pro pol 


18 


279 


6 


77 


0 


1176 


25 


ABV753 92 


TGFB1 Arg2 5Pro pol 


19 


279 


6 


77 


0 


1560 


9 


AAN81084 


Coding sequence of 


20 


279 


6 


77 


0 


1560 


11 


AAQ03508 


Simian Transformin 


21 


279 


6 


77 


0 


1569 


9 


AAN81085 


Coding sequence of 


22 


279 


6 


77 


0 


1821 


12 


AAQ13392 


Human pro-TGF-beta 


23 


279 


6 


77 


0 


2537 


7 


AAN60972 


Sequence encoding 


24 


279 


6 


77 


0 


2537 


11 


AAQ03301 


cDNA encoding huma 


25 


279 


6 


77 


0 


2537 


11 


AAQ02814 


Sequence of pre-TG 


26 


279 


6 


77 


0 


2537 


15 


AAQ56923 


Human pre-TGF-beta 


27 


279 


6 


77 


0 


2537 


17 


AAT15720 


Pre- trans forming g 


28 


279 


6 


77 


0 


2537 


19 


AAV52933 


Human pre-transfor 


29 


279 


6 


77 


0 


2742 


22 


AAI58342 


Human polynucleoti 


30 


279 


6 


77 


0 


2745 


16 


AAT05876 


cDNA encoding tran 


31 


279 


6 


77 


0 


2745 


22 


AAH28216 


Nucleotide sequenc 


32 


279 


6 


77 


0 


3541 


17 


AAT16516 


Collagen Al/TGF-be 


33 


279 


6 


77 


0 


3541 


21 


AAA12498 


cDNA encoding a ch 


34 


279 


6 


77 


0 


4105 


15 


AAQ55624 


TGFbetal 5 ' -UTR-CD 


35 


278 


76 


6 


339 


12 


AAQ11993 


Transforming Growt 


36 


278 


76 


6 


339 


17 


AAT15462 


Human transforming 


37 


278 


76 


6 


1266 


24 


ABK12871 


Chimeric TGF-B-SA 


38 


278 


76 


6 


1569 


11 


AAQ03509 


Human Transforming 



39 


278 


76 


. 6 


2527 


25 


ABQ76674 


Androgen receptor 


40 


275.4 


75 


. 9 


345 


14 


AAQ41599 


Mature human Trans 


41 


266.8 


73 


. 5 


336 


14 


AAQ41606 


Transforming Growt 


42 


262 .2 


72. 


.2 


1561 


11 


AAQ04908 


Sequence encoding 


43 


262 .2 


72 . 


.2 


1561 


13 


AAQ29177 


TGF-beta 1/beta 2 


44 


253 .2 


69, 


.8 


875 


23 


AAS70979 


DNA encoding novel 


45 


248 


68. 


.3 


336 


14 


AAQ41604 


Transforming Growt 



ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 Standard; cDNA ; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss . 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 16.. 1188 

FT /*tag= a 

FT /product= "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12980 . 
XX 

PR 20-APR-2000; 2000US- 199014P . 
XX 

PA (USSH ) US DEPT HEALTH 8c HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 
DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 
PT disease in humans, comprises vector containing transforming growth 
PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 
CC gene encoding a regulatory transcription factor under the control of a 



CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 84.5%; Score 306.8; DB 24; Length 1326; 

Best Local Similarity 99.4%; Pred . No. 2.8e-64; 

Matches 3 08; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

ii ; m 1 1 1 1 1 m 1 1 , 1 1 . i ; 1 1 1 ii 1 1 1 : i , : : ' i 1 1 1 1 1 1 . 1 1 1 1 1 n 1 1 1 1 1 

Db 87 9 CACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 938 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 1 1 1 1 1 1! II I II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I li I M I ! I 

Db 93 9 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 998 

Qy 174 C^TCTGGAGCCTAGACACTCAGTAC^CAAGGTCCTGGCTCTGTAC^CCAGCACAACCC 233 

'I I I I : I ' I i r < I I I I M I I I I I I I I I I I I 

Db 99 9 CATCTGGAGCCTAGACACTCAGTACAGCAAGGT^ 1058 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

I I I I ' I I 

Db 1059 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1118 

Qy 2 94 CTACGTGGGCCGCAAGCCC^GGTGGAGC^GCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

M 1 1 :i 1 1 1 1 1 II I '1 1 1 1 1 1 1 1 1 1 1 II I Ml 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M ,1 II 1 1 1 1 1 M 

Db 1119 CTACGTGGGCCG(^GCC(^GGTGGAGCAGCTGTC(^CATGATCGTGCGTTCCTGCAA 1178 

Qy 354 GTGCAGCTGA 363 

MIIMIIII 

Db 1179 GTGCAGCTGA 118 8 



RESULT 2 
AAT42771 

ID AAT42771 standard; cDNA; 339 BP. 
XX 

AC AAT42771; 
XX 

DT 26-AUG-1997 (first entry) 
XX 

DE TGF-betal active fragment of a TGF-beta fusion protein encoding cDNA. 



XX 

KW Transforming growth factor-beta fusion protein; wound healing; 

KW artificial skin; surgery recovery time; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1 . .339 

FT /*tag= a 

FT /function= TGF active fragment 

XX 

PN WO9639430-A1. 
XX 

PD 12-DEC-1996. 
XX 

PF 05-JUN-1996; 96WO-US08 973 . 
XX 

PR 06-JUN-1995; 95US- 04 70837 . 
XX 

PA (CHEU/) CHEUNG D T. 

PA (HALL/) HALL F L. 

pa (mm/) NIMNI M E. 

PA (TUAN/) TUAN T. 

PA (WULL/) WU L. 
XX 

PI Cheung DT, Hall FL, Nimni ME, Tuan T, Wu L; 
XX 

DR WPI; 1997-043065/04. 

DR P-PSDB; AAW08173. 
XX 

PT Prepn. of transforming growth factor-beta fusion protein - useful to 

PT reduce surgery recovery time and to prepare artificial skin 

XX 

PS Disclosure; Page 44-45; 59pp; English. 
XX 

CC A novel transforming growth factor-beta (TGF-beta) fusion protein 

CC comprises a purification tag and a TGF active fragment. The present 

CC sequence encodes a specifically claimed TGF active fragment, TGF-betal. 

CC Additionally, the fusion protein may comprise proteinase -sens it ive 

CC linker sites and binding domain so the protein sequence may contain 

CC some or all of the following elements: purification tag : proteinase 

CC site:ECM binding site : proteinase site : TGF-beta . TGF-beta promotes 

CC wound healing, and the fusion protein can be used to reduce surgery 

CC recovery time and in the preparation of artificial skin. The inclusion 

CC of a purification tag facilitates purification of the fusion protein. 

CC The proteinase site is included to permit cleavage and release of the 

CC purification tag after purification if desired. The extracellular 

CC matrix binding site facilitates delivery of the fusion protein to the 

CC desired site of action. Delivery of the TGF-beta to the site to be 

CC treated reduces the amount of TGF-beta required to be administered to 

CC be effective and reduces the concentration of circulating TGF-beta 

CC which may result in undesirable effects. 
XX 

SQ Sequence 339 BP ; 66 A; 113 C; 100 G; 60 T; 0 other; 



Query Match 77.5%; Score 281.2; DB 18; Length 339; 

Best Local Similarity 94.2%; Pred. No. 3e-58; 



Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 



Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

M 1 1 ! 1 1 , M II 1 1 1 1 1 1 i I ' 1 1 ! 1 1 M ' llllllllllllll llllllll Mill 

Db 30 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Illllllll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllll llllllll Mill 

Db 90 GAAGTGGATCCATGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

Ml llllllll Mill MINI IMIIIIIIII [ r I f 1 1 1 1 1 1 1 1 1 ] Mill 

Db 150 (^TTTGGAGCCTGGAC^CGCAGTAC^GC^AGGTCCTGGCCCTGTACAACCAGCATAACCC 209 

Qy 234 GGG CGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 ' I 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

Db 210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I llllllll 
Db 2 70 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

Qy 354 GTGCAGCTGA 363 

1 1 1 1 1 ii : i 

Db 3 30 GTGCAGCTGA 33 9 



RESULT 3 
AAV99375 

ID AAV99375 standard; cDNA; 339 BP. 
XX 

AC AAV99375; 
XX 

DT 25-MAR-1999 (first entry) 
XX 

DE cDNA encoding a transforming growth factor beta active fragment. 
XX 

KW Proteinase site; bone morphogenet ic fusion protein; bone binding site; 

KW bone morphogenet ic protein; transforming growth factor beta; 

KW active fragment; wound healing; bone growth; purification tag; ds . 

XX 

OS Homo sapiens. 
XX 

PN W09855137-A1 . 
XX 

PD 10-DEC-1998. 
XX 

PF 02-JUN-1998; 98WO-US1 1 1 8 9 . 
XX 

PR 03-JUN-1997; 97US- 08 68452 . 
XX 

PA (HALL/) HALL F L. 

PA (HANB/) HAN B. 

PA (NIMN/) NIMNI M E. 

PA (SHOR/) SHORS E C. 

PA (WULL/) WU L . 

XX 



PI Hall FL, Han B, Nimni ME, Shors EC, Wu L; 
XX 

DR WPI; 1999-059875/05. 

DR P-PSDB; AAW84207. 
XX 

PT New bone morphogenet ic fusion proteins - comprising a purification 

PT tag and a bone morphogenetic active fragment, used for enhancing 

PT wound healing or bone growth 
XX 

PS Example 1; Page 41; 64pp; English. 
XX 

CC The present sequence encodes a transforming growth factor beta active 

CC fragment. The protein can be used in place of a bone morphogenetic 

CC active fragment to create the fusion proteins of the invention. When a 

CC bone morphogenetic active fragment is used, the fusion proteins are 

CC designated bone morphogenetic fusion proteins. The bone morphogenetic 

CC fusion protein may contain some or all of the following elements: a 

CC purification tag, a proteinase site, an ECM/bone binding site, a second 

CC proteinase site, and a bone morphogenetic protein active fragment. 

CC The bone morphogenetic fusion proteins can be used for enhancing wound 

CC healing or bone growth. 

XX 

SQ Sequence 339 BP; 66 A; 113 C; 100 G; 60 T; 0 other; 

Query Match 77.5%; Score 281.2; DB 20; Length 33 9; 

Best Local Similarity 94.2%; Pred. No. 3e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 : 1 1 II ! 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 MINI MINI I MINI Mill 

Db 3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI Mill I ! 1 1 1 r 1 1 1 1 1 1 r 1 1 1 1 1 ! F "II llllllll Mill 

Db 90 GAAGTGGATCCATGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 149 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAAC^^ 233 

III llllllll Mill 1 1 1 1 1 : 1 MINIMI Mil llllllll 1 Mill 

Db 150 CATTTGGAGCCTGGAC^CGC^GTAC^GCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 209 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MMM 1 1 II 1 1 1 1 1 1 IM M MM 1 1 1 1 M II I Ml 1 1 1 1 II IMMIIMIMII 

Db 210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 

Qy 2 94 CTACGTGGGCCGC^GCCC^GGTGGAG(^GCTGTCCAAC^TGATCGTGCGTTCCTGC^A 3 53 

M I III M I M II M M I MMM 1 1 1 M M II M 1 1 1 1 M I III II II M llllllll 

Db 2 70 CTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCC^CATGATCGTGCGCTCCTGCAA 329 

Qy 354 GTGCAGCTGA 3 63 

IMIIIMM 

Db 33 0 GTGCAGCTGA 339 



RESULT 4 
ABL99528 

ID ABL99528 standard; cDNA; 489 BP. 
XX 



AC ABL99528; 
XX 

DT 02-JUL-2002 (first entry) 
XX 

DE Target canine gene TGFB1 . 
XX 

KW Canine gene array; toxicological response; ss. 
XX 

OS Can is sp . 
XX 

PN WO200208453-A2 . 
XX 

PD 31-JAN-2002. 
XX 

PF 23-JUL-2001; 2 0 0 1WO-US233 1 1 . 
XX 

PR 21-JUL-2000; 2000US-220057P . 
XX 

PA (PHAS-) PHASE- 1 MOLECULAR TOXICOLOGY. 
XX 

PI Farr SB, Pickett GG, Neft RE , Dunn RT; 
XX 

DR WPI; 2002-217063/27. 
XX 

PT Identifying toxicologically relevant canine gene to determine 

PT toxicological responses of agents, by obtaining and comparing gene 

PT expression profiles of untreated canine cells and canine cells treated 

PT with an agent 

XX 

PS Disclosure; Page 67; 140pp; English. 
XX 

CC This invention relates to identifying a toxicologically relevant canine 

CC gene and the generation of an array of toxicologically relevant 

CC canine genes. The gene array is useful for obtaining a gene expression 

CC profile, by exposing a population of cells to an agent, obtaining cDNA 

CC from the population of cells, labeling the cDNA, and contacting the cDNA 

CC with the gene array. The relevant gene is useful for making and using 

CC arrays to determine toxicological responses to various agents, and also 

CC useful for identifying novel gene sequences and novel canine genes. 

CC The method for analysing toxicological responses using the canine 

CC gene array is rapid and efficient. The present sequence is related 

CC to the canine gene array. 

XX 

SQ Sequence 489 BP; 84 A; 183 C; 145 G; 76 T; 1 Other; 

Query Match 77.5%; Score 281.2; DB 24; Length 489; 

Best Local Similarity 94.2%; Pred . No. 3.2e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II NINNIINNINN MIIIIIIIIIIIIIIIIIIMI Mill MINIM 

Db 101 CACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTG 160 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI INN IIININ INN II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 INN 

Db 161 GAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTA 22 0 



Qy 174 CATCTGGAGCCTAGACACTCAGTAGAACAAGGTCCTGGCTC^ 233 

Ml llllllll Mill lllllll llllllllllll llllllllllllllllllll 
Db 221 C^TTTGGAGCCTGGACACGCAGTACAGC^GGTCCTGGCCCTGTACAACCAGCACAACCC 28 0 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

I I I I ; I I I M I I I I I I I I I I I I I I' I II I II I I h I I I I : I I I I ' II I I I I I I I I M I | I 
Db 281 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 34 0 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

I I II II I II ' I I I I : I I I I I I I I II I I I I I I I I I llllllllllllll llllllll 
Db 341 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCTGCAA 4 00 

Qy 354 GTGCAGCTGA 363 

Db 4 01 GTGCAGCTGA 410 



RESULT 5 




AAQ09317 




ID 


AAQ09317 standard; cDNA; 1303 BP. 


XX 






AC 


AAQ09317; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


12-AUG-1990 


(first entry) 


XX 






DE 


Monkey transforming growth factor-beta cDNA. 


XX 






KW 


Transforming growth factor-beta; simian; psoriasis; 


KW 


TGF-beta . 




XX 






OS 


Monkey . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


sigjpept ide 


22. .63 


FT 




/*tag= a 


FT 


mat_pept ide 


836. . 1170 


FT 




/*tag= b 


FT 




/product =monkey transforming growth factor-beta 


XX 






PN 


EP353772-A. 




XX 






PD 


07-FEB-1990. 




XX 






PF 


04 -AUG- 198 9 ; 


89EP-0114458 . 


XX 






PR 


05-AUG-1988; 


88US-0229133 . 


XX 






PA 


(ONCO ) ONCOGEN LP. 


XX 






PI 


Twardzik DR, 


Purchio AF, Ranchalis JE, Stevens V; 


XX 




DR 


WPI; 1990-038499/06. 


DR 


P-PSDB; AAR03 74 3. 


XX 






PT 


Inhibition of 


proliferation of epidermal cells - 


PT 


used to treat 


psoriasis by contacting cells with compositions 



PT containing transforming growth factor-beta. 
XX 

PS Disclosure; fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis-categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03268 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1303 BP; 263 A; 442 C; 378 G; 217 T; 3 other; 

Query Match 77.5%; Score 281.2; DB 11; Length 1303; 

Best Local Similarity 94.2%; Pred. No. 3.8e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllllllllllllllll II lllllllllll MINIM Mill 

Db 8 67 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTG 92 6 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI II II lllllllllll MINIM IIMMINMIIIMMIN I 

Db 927 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTA 98 6 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTC 233 

III llllllll MM 1 1 1 1 N I llllllllllll MINIMUM Mill 

Db 98 7 CATTTGGAGCCTGGACACGCAGTAC^GC^GGTCCTGGCCCTGTACAACCAGCATAACCC 104 6 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

llllll I II 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml II IN 1 1 II 1 1 1 1 II 1 1 M III 

Db 104 7 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1106 

Qy 2 94 CTACGTGGGCCGC^GCC(^GGTGGAGCAGCTGTCC^(^TGATCGTGCGTTCCTGC^ 3 53 

: 1 1 ! 1 1 M 1 1 1 . 1 1 1 1 1 M I i 1 1 1 1 1 1 1 1 1 1 1 , 1 1 i 1 1 ' 1 1 1 1 1 1 1 1 ! 1 1 llllllll 

Db 1107 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1166 

Qy 3 54 GTGCAGCTGA 363 

Illllllll 

Db 1167 ATGCAGCTGA 1176 

RESULT 6 
AAQ20289 

ID AAQ20289 Standard; cDNA; 1559 BP. 
XX 

AC AAQ2 028 9; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator; 

KW ss. 

XX 

OS Monkey . 
XX 



FH 


Key 


Loca t ion/ Qual i f iers 


FT 


CDS 


262 . .282 


FT 




/*tag= a 


FT 


sig peptide 


283 . . 324 


FT 




/*tag= b 


FT 


CDS 


325 . . 1098 


FT 




/*tag= c 


FT 


mat_peptide 


1099 . . 1436 


FT 




/*tag= d 


XX 






PN 


W09119513-A. 




XX 






PD 


26-DEC-1991 . 




XX 






PF 


20-JUN-1991; 


91WO-US04449. 


XX 






PR 


20-JUN-1990; 


90US-0541221. 


XX 






PA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


XX 






PI 


Oleson FB, 


Comereski CR; 


XX 






DR 


WPI; 1992-024199/03. 


DR 


P-PSDB; AAR20124. 


XX 






PT 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists 


- for modulating blood pressure, for treating 


PT 


hypertension and hypotension 


XX 






PS 


Disclosure; 


Fig 1; 42pp; English. 


XX 






cc 


A new method 


for treating hypertension comprises administering a 


cc 


transforming growth factor (TGF) -beta to an individual at a dose 


cc 


effective for lowering blood pressure; the TGF-beta may be e.g. 


cc 


mature TGF-beta, TGF-beta2 / a mature TGF-betal/beta2 hybrid, TGF- 


cc 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF-beta 1 /TGF - 


cc 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


cc 


complex. 




cc 


(Updated on 


25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 other; 



Query Match 77.5%; Score 281.2; DB 13; Length 1559; 

Best Local Similarity 94.2%; Pred. No. 4e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 



Qy 


54 


CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 


113 






II 1 I II MINIMI MINIM Mill 




Db 


1124 


CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTG 


1183 


Qy 


114 


GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 


173 






lllllllll II II MM MMI MINI M II 1 M 1 M 1 II 1 1 II II M 1 1 1 1 1 1 




Db 


1184 


GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTA 


1243 


Qy 


174 


CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTC 


233 






Ml MIMIII Mill M MM MINIMUM 1 M 1 1 1 1 1 1 1 1 ! 1 1 Mill 




Db 


1244 


CATTTGGAGCCTGGAC^CGC^GTAC^GC^GGTCCTGGCCCTGTACAACCAGCATAACCC 


1303 



Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MINI 1 1 ! 1 1 1 1 1 1 1 II ' 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I ill ! 1 1 M ! 1 1 1 1 1 1 1 1! Ill 

Db 13 04 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1363 

Qy 294 CTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 13 64 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1423 

Qy 354 GTGCAGCTGA 3 63 

Illllllll 
Db 1424 ATGCAGCTGA 1433 



RESULT 7 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 

AC AAQ03268; 
XX 

DT 25-MAR-2003 (updated) 

DT 12-AUG-1990 (first entry) 

XX 

DE Simian transforming growth factor-beta cDNA. 
XX 

KW Transforming growth factor-beta; psoriasis; TGF-beta; ss . 
XX 

OS Monkey . 
XX 

FH Key Location/Qualifiers 
FT sig_peptide 283.. 324 

FT /*tag= a 

FT mat_peptide 1096.. 1431 

FT /*tag= b 

FT /product =human transforming growth factor-beta 

XX 

PN EP353772-A. 
XX 

PD 07-FEB-1990. 
XX 

PF 04-AUG-1989; 8 9EP- 01 14458 . 
XX 

PR 05-AUG-1988 ; 8 8US - 022 9 133 . 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Twardzik DR, Purchio AF , Ranchalis JE, Stevens V; 
XX 

DR WPI; 1990-038499/06. 
DR P-PSDB; AAR03743 . 
XX 

PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 20pp; English. 
XX 



CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis -categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 other; 

Query Match 77.5%; Score 281.2; DB 11; Length 1561; 

Best Local Similarity 94.2%; Pred. No. 4e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II Mill II I I I II II ; I II lllllllllll MINIM Mill 

Db 1125 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTG 1184 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1185 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTA 124 4 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAA 2 33 

III MINIM Mill MINN MINIMUM 1 1 1 1 1 1 1 1 1 1 1 1 II INN 

Db 124 5 C^TTTGGAGCCTGGAC^CGC^GTACAGa^GGTCCTGGCCCTGTAC^CCAGCATAACCC 13 04 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

WWW MM M II M M 1 1 M I M I M II 1 1 M M Ml Ml M M M I M Ml I M 1 1 

Db 13 05 GGG CGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 13 64 

Qy 2 94 CTACGTGGGCCGC^GCCC^GGTGGAGC^GCTGTCCAAC^TGATCGTGCGTTCCTGC^A 3 53 

I ,M'M MIIIMM MIMM IMIIIIIIMli MMMM 

Db 13 65 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1424 

Qy 354 GTGCAGCTGA 3 63 

MIIIMM 

Db 142 5 ATGCAGCTGA 1434 



RESULT 8 




AAQ03269 




ID 


AAQ03269 standard; DNA; 1571 BP. 


XX 






AC 


AAQ03269; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


12-AUG-1990 


(first entry) 


XX 






DE 


Human transforming growth factor-beta cDNA . 


XX 






KW 


Transforming growth factor-beta; psoriasis; 


KW 


TGF-beta . 




XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Locat ion/Qual i f iers 


FT 


sig_pept ide 


22. .63 


FT 




/*tag= a 


FT 


mat_peptide 


836. . 1170 



FT /*tag= b 

FT /product =human transforming growth factor-beta 
XX 

PN EP353772-A. 
XX 

PD 07-FEB-1990. 
XX 

PF 04-AUG-1989; 89EP-0114458 . 
XX 

PR 05-AUG-1988; 88US-0229133 . 
XX 

PA (ONCO ) ONCOGEN LP . 
XX 

PI Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 
XX 

DR WPI; 1990-038499/06. 

DR P-PSDB; AAR03750. 
XX 

PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis -categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03268 and AAR03743. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1571 BP; 299 A; 563 C; 443 G; 266 T; 0 Other; 



Query Match 77.5%; Score 281.2; DB 11; Length 1571; 

Best Local Similarity 94.2%; Pred . No. 4e-58; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllllllllllllllll II lllllllllll llllllll Mill 

Db 1135 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTG 1194 

Qy 114 GAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI M II 1 1 1 : 1 1 1 1 1 1 1 1 ! I i 1 1 1 1 1 1 1 M 1 1 1 1 1 ' M 1 1 1 1 1 1 1 1 1 1 

Db 1195 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTA 12 54 

Qy 174 CATCTGGAGCCTAGACACTGAGTACAACAAGGTCCTGGCTCTGTACAA 233 

III llllllll Mill lllllll llllllllllll llllllllllllll Mill 

Db 12 55 CATTTGGAGCCTGGACACGC^GTACAGCAAGGTCCTGGCCCTGTACAACC^GCATAACCC 1314 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MUM I II II I I I I I I II II II I I I I II I I I I I I II II I I I I II I I I I II I II II I I I 
Db 1315 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1374 

Qy 2 94 CTA CGTGGG C CG CAAG C CCAAGGTGGAG CAG CTGTCCAACATGAT CGTG CGTT CCTG CAA 353 

I 1 1 1 ! M 1 1 1 M II 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 , 1 ; 1 1 1 M 1 1 1 1 1 1 1 1 llllllll 

Db 13 75 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1434 



Qy 



3 54 GTGCAGCTGA 363 



Illllllll 

Db 143 5 ATGCAGCTGA 1444 



RESULT 9 
AAT17235 

ID AAT17235 standard; cDNA to mRNA; 339 BP. 
XX 

AC AAT17235; 
XX 

DT 17-JUL-1996 (first entry) 
XX 

DE Human TGF-beta 1 cDNA. 
XX 

KW Transforming growth factor type beta; TGF-beta 1; 

Kw protein renaturat ion; protein folding; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO9603433-A1 . 
XX 

PD 08-FEB-1996. 
XX 

PF 12-JUL-1995; 95WO- EP02 7 1 9 . 
XX 

PR 25-JUL-1994; 94EP-08 1043 9 . 
XX 

PA (CIBA ) CIBA GEIGY AG. 
XX 

PI Cerletti N; 
XX 

DR WPI; 1996-117000/12. 

DR P-PSDB; AAR92773 . 
XX 

PT Prodn. of dimeric biologically active transforming growth factor 

PT by refolding denatured monomer in detergent- free folding buffer 

PT contg. specific organic solvent to improve yield 
XX 

PS Example IB; Page 20-30; 54pp ; English. 
XX 

CC The coding sequence (AAT17235) of human transforming growth factor 

CC TGF-beta 1 (AAR92773) was cloned into plasmid pGEM-5ZF (+) (Promega) 

CC and the construct used to transform E. coli Y1090. Subcloning in 

CC pPLMu yielded plasmid pPLMu . hTGF-beta 1. Non-soluble, monomeric 

CC TGF-beta 1 was recovered from E. coli LC 137/pPLMu. hTGF-beta 1 (DSM 

CC 5656) transformants . A biologically active, dimeric form of 

CC TGF-beta 1 was obtd. by refolding this monomer in detergent - free 

CC buffer contg. DMSO and/or DMF. Dimers of TGF-beta 3 (AAR92772) and 

CC TGF-beta 2 (AAR92774), and hybrid dimers (see also AAR92775-77) , were 

CC also produced. 
XX 

SQ Sequence 339 BP; 66 A; 114 C; 100 G; 59 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 17; Length 339; 

Best Local Similarity 93.9%; Pred. No. 7.3e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 



Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I I I I I I I I I I i I I I I I ! I I ! I I I I I llllllllllllll llllllll lllll 

Db 30 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

lllllll II II Mill IIIIIIIMIilM llllllll llllllll lllll 

Db 90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 149 

Qy 174 CATCTGGAGCCTAGACACTC^GTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 23 3 

III llllllll lllll lllllll llllllllllll llllllllllllll lllll 

Db 150 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 20 9 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MMII I II 1 1 1 II 1 1 1 1 1 1 M Ml 1 1 Ml I M I M I I Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

MMIII III II MMII MM III I I MM 1 1 II lllll Ml MUM llllllll 

Db 270 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

Qy 3 54 GTGCAGCTGA 363 

Illlllllll 
Db 330 GTGCAGCTGA 33 9 



RESULT 10 




AAX15245 




ID 


AAX15245 standard; cDNA ; 339 BP. 


XX 






AC 


AAX15245; 




XX 






DT 


20-MAR-2003 


(updated) 


DT 


28-APR-1999 


(first entry) 


XX 






DE 


cDNA encoding 


the mature form of transforming growth factor- 


XX 






KW 


Transforming growth factor-beta- 1 ; TGF-beta-1 ike protein; 


KW 


S-sulphonated TGF-beta-like protein; wound treatment; cancer 


KW 


bone repair; 


tissue repair; bone marrow protective agent; 


KW 


cardioprotect ion; ant i - inflammatory ; immunosuppressive; 


KW 


ulcer; bed sore; ds . 


XX 






OS 


Homo sapiens . 




XX 






PN 


EP891985-A1 . 




XX 






PD 


20-JAN-1999 . 




XX 






PF 


27-NOV-1990; 


98EP-0113487. 


XX 






PR 


06-DEC-1989; 


89GB-0027546. 


PR 


27-NOV-1990; 


90EP-0810922 . 


XX 






PA 


(NOVS ) NOVARTIS AG. 


XX 






PI 


Cerletti N, 


Cox D, McMaster GK, Meyhack B, Schmitz A; 


XX 







DR WPI; 1999-083520/08. 

DR P-PSDB; AAW97091. 
XX 

PT Producing biologically active dimeric Transforming Growth 

PT Factor-beta - by refolding new monomeric Transforming Growth 

PT Factor-beta, useful for treatment of wounds and cancer 
XX 

PS Example 1; Page 28; 32pp ; English. 
XX 

CC The present sequence encodes the mature form of transforming growth 

CC factor-beta-1 . Dimeric, biologically active TGF-beta-like protein 

CC can be produced by subjecting the denatured monomeric form to refolding 

CC conditions. The new monomeric S-sulphonated TGF-beta-like protein is 

CC useful for the production of the dimeric, biologically active 

CC TGF-beta-like protein, which is useful for the treatment of wounds 

CC (surface or internal) and cancer in a mammal, in bone and tissue 

CC repair, as a bone marrow protective agent, a mediator of 

CC cardioprotection, for the production of an ant i - inflammatory or 

CC immunosuppressive preparation. Treatment is useful for animals, 

CC especially humans, and wound treatment (e.g. ulcers, bed sores etc.) is 

CC particularly useful for the elderly. 

CC (Updated on 20-MAR-2003 to correct PF field.) 

CC (Updated on 20-MAR-2003 to correct PR field.) 

XX 

SQ Sequence 339 BP; 66 A; 114 C; 100 G; 59 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 20; Length 33 9; 

Best Local Similarity 93.9%; Pred. No. 7.3e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 



Db 




Qy 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 



Db 




Qy 



174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCT 233 



Db 




Qy 



234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 



Db 




Qy 



2 94 CTACGTGGGCCGC7^GCCCAAGGTGGAG(^GCTGTC(^(^TGATCGTGCGTTCCTGC^A 353 



Db 




Qy 



3 54 GTGCAGCTGA 363 



Db 




RESULT 11 
ABK84023 



ID ABK84023 standard; cDNA; 650 BP. 
XX 

AC ABK84 023; 
XX 

DT 14-AUG-2002 (first entry) 
XX 

DE Human cDNA differentially expressed in granulocytic cells #594. 
XX 

KW Human; ss; granulocytic cell; DNA chip; bacterial infection; 

KW viral infection; parasitic infection; protozoal infection; 

KW fungal infection; sterile inflammatory disease; psoriasis; 

KW rheumatoid arthritis; glomerulonephritis; asthma; thrombosis; 

KW cardiac reperfusion injury; renal reperfusion injury; ARDS; 

KW adult respiratory distress syndrome; inflammatory bowel disease; 

KW Crohn's disease; ulcerative colitis; periodontal disease; 

KW granulocyte activation; chronic inflammation; allergy. 

XX 

OS Homo sapiens . 
XX 

PN WO200228999-A2 . 
XX 

PD ll-APR-2002. 
XX 

PF 03-OCT-2001; 2 0 0 1WO-US3 082 1 . 
XX 

PR 03-OCT-2000; 2000US-237189P . 
XX 

PA (GENE-) GENE LOGIC INC. 
XX 

PI Beazer-Barclay Y, Weissman SM, Yamaga S, Vockley J; 
XX 

DR WPI; 2002-435328/46. 
XX 

PT Detecting granulocyte activation by detecting differential expression 

PT of genes associated with granulocyte activation, which serves as 

PT diagnostic markers that is useful for monitoring disease states and 

PT drug toxicity 
XX 

PS Claim 1; SEQ ID No 594; 114pp ; English. 
XX 

CC The invention relates to detecting (Ml) granulocyte (GC) activation 

CC (GCA) , by detecting the level of expression of gene(s) (Gs) identified by 

CC DNA chip analysis as given in the specification, and comparing 

CC the expression level to an expression level in an unactivated 

CC GC, where differential expression of Gs is indicative of GCA. 

CC Also included are modulating (M2) GA by contacting GC with an agent 

CC that alters the expression of at least one gene in Gs; (2) screening (M3) 

CC for an agent capable of modulating GCA or an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease using the 

CC gene expression profile; (3) detecting (M4) an inflammation (especially 

CC chronic) in a tissue, an allergic response in a subject, exposure of a 

CC subject to a pathogen or sterile inflammatory disease, by detecting the 

CC level of expression in a sample of the tissue of gene(s) from Gs, where 

CC the level of expression of the gene is indicative of inflammation; 

CC (4) treating (M5) an inflammation (especially chronic) or in a tissue, 

CC an allergic response in a subject, exposure of a subject to a pathogen 



CC or sterile inflammatory disease, by contacting a tissue having 

CC inflammation with an agent that modulates the expression of gene(s) 

CC from Gs in the tissue. Ml is useful for detecting GCA; M2 is useful for 

CC modulating GA; M3 is useful for screening an agent capable of modulating 

CC GCA preferably in an inflammation in a tissue; M4 is useful for 

CC detecting an inflammation (especially chronic) in a tissue, an allergic 

CC response in a subject, exposure of a subject to a pathogen or sterile 

CC inflammatory disease (e.g. psoriasis, rheumatoid arthritis, 

CC glomerulonephritis, asthma, thrombosis, cardiac reperfusion injury, renal 

CC reperfusion injury, ARDS, adult respiratory distress syndrome, 

CC inflammatory bowel disease, Crohn's disease, ulcerative colitis, 

CC periodontal disease; also bacterial infection, viral infection, 

CC parasitic infection, protozoal infection, fungal infection and M5 is 

CC useful for treating one of the above conditions. The present 

CC sequence represents a gene differentially expressed in granulocytes. 

CC Note: The sequence data for this patent did not form part 

CC of the printed specification, but was obtained in electronic 

CC format directly from WIPO at 

CC ftp . wipo . int/pub/publishedjpct_sequences . 

XX 

SQ Sequence 650 BP; 108 A; 244 C; 179 G; 119 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 24; Length 650; 

Best Local Similarity 93.9%; Pred. No. 8.2e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllllllllllllllll llllllllllllll llllllll Mill 

Db 30 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

IMIIIIII II II 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 f I f I r llllllll Lllllll Mill 

Db 90 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 

Qy 174 (^TCTGGAGCCTAGACACTCAGTACAACAAG^^ 233 

III llllllll MMI lllllll IMIIMMIM llllllllllllll Mill 

Db 150 GA.TTTGGAGCCTGGACACGC^GTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 209 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

Illlll I II I Ml M 1 1 III 1 1 1 1 1 1 ! II 1 1 1 1 II 1 1 1 M 1 1 : llllllllllllll 

Db 210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 269 

Qy 2 94 CTACGTGGGCCGC^GCCC^GGTGGAGCAGCTGTCO^CATGATCGTGCGTTCCTGCAA 3 53 

IMIMMM llllllll lllllll I'l III III Ml III IIIMIIIMI llllllll 

Db 270 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

Qy 3 54 GTGCAGCTGA 363 

Illlllllll 
Db 33 0 GTGCAGCTGA 33 9 



RESULT 12 
ABL68818 

ID ABL68818 standard; DNA; 650 BP. 
XX 

AC ABL68818; 
XX 



DT 15-MAY-2002 (first entry) 
XX 

DE Kidney cancer related gene sequence SEQ ID NO: 7155. 
XX 

KW Human; cancer; colon; breast; ovary; oesophagus; kidney; thyroid; 

KW stomach; lung; prostate; pancreas; carcinoma; antitumour; cancerous; 

KW cytostatic; gene therapy; antineoplastic; Wilm's tumour; adenocarcinoma; 

KW gene; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200194629-A2 . 
XX 

PD 13-DEC-2001. 
XX 

PF 30-MAY-2001; 2001WO-US10838 . 
XX 

PR 05-JUN-2000; 2000US-209473P . 

PR 05-JUN-2000; 2 0 OOUS - 2 0953 1 P . 

PR 18-SEP-2000; 2000US-233 133P . 

PR 18-SEP-2000; 2000US-233617P . 

PR 20-SEP-2000; 2000US-234009P . 

PR 20-SEP-2000; 2 0 0 OUS - 23 4 034 P . 

PR 20-SEP-2000; 2000US-234052P. 

PR 22-SEP-2000; 2000US-234509P . 

PR 22-SEP-2000; 2000US-234567P . 

PR 25-SEP-2000; 2 0 OOUS - 234 923 P . 

PR 25-SEP-2000; 2000US-234924P . 

PR 25-SEP-2000; 2000US-235077P . 

PR 25-SEP-2000; 2000US-235082P . 

PR 25-SEP-2000; 2000US-235134P . 

PR 25-SEP-2000; 2000US-235280P . 

PR 26-SEP-2000; 2000US-235637P . 

PR 26-SEP-2000; 2000US-235638P . 

PR 27-SEP-2000; 2 000US-23 571 1 P . 

PR 27-SEP-2000; 2000US-235720P . 

PR 27-SEP-2000; 2000US-235840P . 

PR 27-SEP-2000; 2000US-235863P . 

PR 28-SEP-2000; 2000US-236028P . 

PR 28-SEP-2000; 2000US-236032P . 

PR 28-SEP-2000; 2000US-236033P . 

PR 28-SEP-2000; 2000US-236034P . 

PR 28-SEP-2000; 2000US-236109P . 

PR 28-SEP-2000; 2000US-23611 IP . 

PR 29-SEP-2000; 2000US-236842P . 

PR 29-SEP-2000; 2000US-2368 91P . 

PR 02-OCT-2000; 2000US-237172P . 

PR 02-OCT-2000; 2 0 OOUS -23 7173P . 

PR 02-OCT-2000; 2000US-237278P . 

PR 02-OCT-2000; 2000US-237294P . 

PR 02-OCT-2000; 2000US-2372 95P . 

PR 02-OCT-2000; 2000US-2373 16P . 

PR 03-OCT-2000; 2000US-237425P . 

PR 03-OCT-2000; 2 0 00US-23 75 98 P . 

PR 03-OCT-2000; 2000US-237604P . 

PR 03-OCT-2000; 2000US-237606P . 

PR 03-OCT-2000; 2000US-237608P . 



PR 01-NOV-2000; 2 000US-244867P . 

PR Ol-NOV-2000; 2 0 OOUS - 24 5084 P . 
XX 

PA (AVAL-) AVALON PHARM . 
XX 

PI Young PE, Augustus M, Carter KC, Ebner R, Endress G, Horrigan S; 

PI Soppet DR, Weaver Z; 

XX 

DR WPI; 2002-188264/24. 
XX 

PT Screening for anti -neoplast ic agent involves exposing cells to a 

PT chemical agent to be tested for anti-neoplastic activity, and 

PT determining a change in expression of a gene of a signature gene set 

xx 

PS Claim 1; SEQ ID 7155; 44pp ; English. 
XX 

CC The present invention describes a method (Ml) for screening for an 

CC anti-neoplastic agent. The method involves exposing cells to a chemical 

CC agent to be tested for anti-neoplastic activity, determining a change in 

CC expression of at least one gene (I) of a signature gene set, where (I) 

CC comprises a sequence (S) selected from 8447 sequences (given in ABL61664 

CC to ABL70110), or is at least 95% identical to (S) , where a change in 

CC expression is indicative of anti-neoplastic activity. (I) has cytostatic 

CC activity and can be used in gene therapy. Ml can be used for screening 

CC an anti-neoplastic agent, and can be used for producing a product which 

CC is the data collected with respect to the anti-neoplastic agent as a 

CC result of Ml, and the data is sufficient to convey the chemical 

CC structure and/or properties of the agent. Ml can be used in the 

CC treatment of cancer such as colon, breast, stomach, lung, thyroid, 

CC oesophageal, ovarian, kidney, prostate or pancreatic cancer, 

CC adenocarcinoma, carcinoma, clear cell cancer, infiltrating ductal cancer, 

CC infiltrating lobular cancer, squamous cell carcinoma, neuroendocrine 

CC carcinoma, papillary carcinoma and Wilm's tumour. 

XX 

SQ Sequence 650 BP; 108 A; 244 C; 179 G; 119 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 24; Length 650; 

Best Local Similarity 93.9%; Pred . No. 8.2e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 



Db 




Qy 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 



Db 




Qy 



174 CATCTGGAGCC IAGACAC i CAGl ACAALAAGGlCCl'GGL'l'Cl'GlACAACCAGCACAACCC 233 



Db 




Qy 



234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 



Db 




Qy 



2 94 CTACGTGGGCCG(^GCCCAAGGTGGAG(^GCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 



I II : I I I II I I I : Ml I I I r I I i I 

Db 27 0 CTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCC^CATGATCGTGCGCTCCTGCAA 32 9 

Qy 3 54 GTGCAGCTGA 363 

Illlllllll 
Db 33 0 GTGCAGCTGA 33 9 

RESULT 13 
ABZ35738 

ID ABZ35738 standard; DNA; 1176 BP. 
XX 

AC ABZ3 5738; 
XX 

DT 07-FEB-2003 (first entry) 
XX 

DE Human TGF beta 1 polynucleotide SEQ ID NO 46. 
XX 

KW Double stranded RNA; dsRNA; RNAi ; RNA inhibition; cytostatic; virucide; 

KW protozoacide; gene expression; antisense; tumour; infection; Plasmodium; 

KW virus; viroid; anti-GFP; human; HIV; human immunodeficiency virus; 

KW Hepatitis C virus; human papilloma virus; gene; ds . 
XX 

OS Homo sapiens. 
XX 

PN DE10100588-A1 . 
XX 

PD 18-JUL-2002, 
XX 

PF 09-JAN-2001; 2001DE- 1000588 . 
XX 

PR 09-JAN-2001; 2001DE-1000588 . 
XX 

PA (RIBO-) RIBOPHARMA AG. 
XX 

PI Kreutzer R, Limmer S, Rost S, Hadwiger P; 
XX 

DR WPI; 2002-683450/74. 
XX 

PT Inhibiting expression of target genes, useful e.g. for treating tumors, 

PT by introducing into cells two double-stranded RNAs that are 

PT complementary to the target 
XX 

PS Claim 13; Page 34-35; lOOpp; German. 
XX 

CC The invention relates to inhibiting expression of a target gene in a cell 

CC by introducing at least two oligoribonucleotides (dsRNAI and II) , both 

CC with a double-stranded (ds) structure of at most 49 sequential nucleotide 

CC pairs. At least part of one strand (SI , S2) of the ds structures in each 

CC of dsRNAI and II are complementary to regions in the target gene. The 

CC method uses antisense inhibition of gene expression using double stranded 

CC RNA inhibition (RNAi) . The method is particularly used to treat tumours 

CC or infections, especially by Plasmodium or viruses/viroids (pathogenic on 

CC humans, animals or plants) . The method provides more effective inhibition 

CC of expression than known methods using a single dsRNA, even at very low 

CC concentrations. When dsRNA has at least one unpaired nucleotide at the 

CC end, stability (and thus effective concentration in the cell) is 



CC improved and efficiency can be increased further by pretreating the cells 

CC with interferon. The present sequence is that of a target DNA of the 

CC invention. 
XX 

SQ Sequence 1176 BP; 239 A; 382 C; 353 G; 202 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 24; Length 1176; 

Best Local Similarity 93.9%; Pred. No. 9.1e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllillllMIIIIIII llllllllllllll llllllll Mill 

Db 867 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 92 6 

Qy 114 GAAGTGGATTC^TGAACCC^GGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINI. II II II I i ! 1 1 1 1 1 1 1 1 1 1 1 MINIM llllllll Mill 

Db 927 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 98 6 

Qy 174 CATCTGGAGCCTAGACACTC^GTACAACA^ 23 3 

III llllllll Mill IIMIII MIMIMIIM I M 1 1 1 1 i 1 1 ! 1 1 1 Mill 

Db 98 7 CATTTGGAGCCTGGACACGCAGTACAGC^GGTCCTGGCCCTGTAC^Ca^GCATAACCC 104 6 

Qy 234 GGG CG CGTCGG CGG CG CCGTG CTGCGTG C CG CAGG CG CTGGAG CCA CTG CCCAT CGTGTA 2 93 

Illlll 1 1 1 1 1 1 1 M II I II I II I II I II 1 1 1 1 1 II 1 1 1 1 1 II I llllllllllllll 

Db 104 7 GGG CG C CTCGG CGG CG C CGTG CTG CGTGCCG CAGG CG CTGGAG CCG CTG CCCAT CGTGTA 1106 

Qy 2 94 CTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

I I I I I I I I I I I M I I I I I I I II I I I I I I I I I I I I I I I I [ I I I I I I I I I I I I llllllll 
Db 1107 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1166 

Qy 3 54 GTGCAGCTGA 3 63 

Illlllllll 
Db 1167 GTGCAGCTGA 1176 



RESULT 14 
ABX09981 

ID ABX09981 standard; DNA; 1176 BP. 
XX 

AC ABX09981; 
XX 

DT 23-JAN-2003 (first entry) 
XX 

DE Human TGFbetal DNA fragment SEQ ID 46. 
XX 

KW Oligoribonucleotide; interferon; oncogene; cytokine; Id; developmental; 

KW prion; inhibition; human; ds . 

XX 

OS Homo sapiens. 
XX 

PN DE10100587-C1 . 
XX 

PD 21-NOV-2002. 
XX 

PF 09-JAN-2001; 2001DE-1000587. 
XX 

PR 09-JAN-2001; 2001DE-1000587. 



XX 

PA (RIBO-) RIBOPHARMA AG. 
XX 

PI Kreutzer R, Limmer S, Rost S, Hadwiger P; 
XX 

DR WPI; 2002-742209/81. 
XX 

PT Inhibiting expression of target genes, e.g. oncogenes, in cells, by 

PT introduction of complementary double-stranded ol igoribonucleot ide, 

PT after treating the cell with interferon 
XX 

PS Disclosure; Page 3 9-40; 98pp; German. 
XX 

CC This invention describes a novel method for inhibiting expression of a 

CC target gene by introducing into the cell that contains the target gene 

CC at least one oligoribonucleotide (dsRNAI) that has a double-stranded 

CC (ds) structure of not more than 4 9 consecutive nucleotides (nt) , where 

CC at least a segment of one strand of the ds structure is complementary 

CC with the target gene and the cells are treated with interferon before 

CC introduction of dsRNAI . The method is used to inhibit expression of 

CC target genes, particularly oncogenes, cytokine genes, Id (not defined) 

CC protein genes; developmental or prion genes, or genes expressed in 

CC pathogenic organisms (particularly Plasmodia) or in viruses or viroids 

CC (pathogenic in humans, animals or plants) . Treating the cells with 

CC interferon greatly increases the extent to which dsRNA can inhibit 

CC expression of the target genes, and the effect is even greater when dsRNA 

CC are modified to increase their stability. ABX09936-ABX10075 represent 

CC gene fragments used to illustrate the method of the invention. 

XX 

SQ Sequence 1176 BP; 239 A; 382 C; 353 G; 202 T; 0 other; 



Query Match 77.0%; Score 279.6; DB 24; Length 1176; 

Best Local Similarity 93.9%; Pred. No. 9.1e-58; 

Matches 2 91; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 



Qy 


54 


CAAGG AGAAGAA CTG CTG CGTG CGG CAG CTCTA CATTG A CTTC CGG AAGGAC CTGGG CTG 

II lllllllllllllllllllllllllll IIIIIIIIIIIIII IIIIIIII Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 


113 


Db 


867 


926 


Qy 
Db 


114 
927 


GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

Illllllll II II llllllllllllllllllll IIIIIIII IIIIIIII Mill 

GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 


173 
986 


Qy 


174 


CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 

III IIIIIIII Mill lllllll llllllllllll IIIIIIIIIIIIII Mill 
C^TTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC^GCATAACCC 


233 


Db 


987 


1046 


Qy 


234 


GGG CG CGT CGG CGG CGCCGTG CTG CGTG CCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

Mill, IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIII 

GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 


293 


Db 


1047 


1106 


Qy 


294 


CTACGTGGGCCGC^GCCC^GGTGGAGC^GCTGTCCAAa^TGATCGTGCGTTCCTGCAA 

1 M 1 II 1 1 1 1 1 1 1 1 1 ! M 1 II 1 1 1 1 1 h II ! II 1 1 i:m 1 1 1 1 1 1 1 1 III 1 iiiiiiii 

CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 


353 


Db 


1107 


1166 


Qy 


354 


GTGCAGCTGA 3 63 





1 1 1 1 1 1 J 1 1 1 



Db 1167 GTGCAGCTGA 1176 



RESULT 15 
ABV78162 

ID ABV78162 standard; DNA; 1176 BP. 
XX 

AC ABV78162; 
XX 

DT 15-NOV-2002 (first entry) 
XX 

DE Human TGF beta 1 DNA SEQ ID NO 46. 
XX 

KW RNA inhibition; dsRNAl; gene expression inhibitor; oncogene; cytostatic; 

KW virucide; protozoacide ; gene; ds . 

XX 

OS Homo sapiens. 
XX 

PN WO200255693-A2 . 
XX 

PD 18-JUL-2002. 
XX 

PF 09-JAN-2002; 2 002WO-EP00152 . 
XX 

PR 09-JAN-2001; 2001DE-1000586 . 

PR 26-OCT-2001; 2001DE-1055280 . 

PR 29-NOV-2001; 2 001DE- 10584 11 . 

PR 07-DEC-2001; 2 00 IDE- 1 060 1 5 1 . 
XX 

PA (RIBO-) RIBOPHARMA AG. 
XX 

PI Kreutzer R, Limmer S, Rost S, Hadwiger P; 
XX 

DR WPI; 2002-590671/63. 
XX 

PT Inhibiting expression of target gene, useful e.g. for inhibiting 

PT oncogenes, by administering double- stranded RNA complementary to the 

PT target and having an overhang 

XX 

PS Claim 10; Page 138; 203pp ; German. 
XX 

CC The invention relates to inhibiting expression of a target gene (I) in a 

CC cell by introducing an inhibitory RNA (dsRNAl) having a double- stranded 

CC structure of at most 49 consecutive bases. At least part of one strand 

CC (asl) of dsRNAl is complementary to (I) and at least one end of dsRNAl 

CC has an overhang of 1-4 nucleotides. The method is used to inhibit the 

CC expression of a wide range of genes, e.g. oncogenes, cytokine genes etc. 

CC in humans, also genes in Plasmodium or in viruses or viroids that are 

CC pathogenic for humans, animals or plants. Introducing an overhang into 

CC dsRNA greatly increases effectiveness for inhibiting gene expression, 

CC both in vivo and in vitro and also increases stability and thus the 

CC effective concentration inside the cell. The present sequence is that of 

CC a gene related to the invention. 
XX 

SQ Sequence 1176 BP; 239 A; 382 C; 353 G; 202 T; 0 other; 

Query Match 77.0%; Score 279.6; DB 24; Length 1176; 



Best Local Similarity 93.9%; Pred . No. 9.1e-58; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 



Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II MIMIIMIMII II Illill II 1 1 1 1 1 Ml III 1 1 1 MINIM Mill 

Db 867 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 926 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI II II I IMIMIII IMIIMI IIIIIIM 1 1 1 1 M 1 1 Mill 

Db 927 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 98 6 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

III IIIIIIM Mill Mllll MIMIMIMI II 1 1 1 1 1 1 1 1 1 1 M Mill 

Db 987 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 104 6 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

Mllll llllllllllllllllllllllllllllllllllllll I 

Db 104 7 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 1106 

Qy 294 CTACGTGGGCCGC^GCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

M M 1 1 III 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M I II 1 1 M M I III I il II M 1 1 IIIIIIM 

Db 1107 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1166 

Qy 354 GTGCAGCTGA 363 

MINIUM 
Db 1167 GTGCAGCTGA 1176 



Search completed: October 27, 2003, 19:11:03 
Job time : 110.057 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 18:35:27 ; Search time 107.894 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 

Title: US-10-017-372E-14 
Perfect score: 363 

Sequence : 1 gccctggataccaactactg gttcctgcaagtgcagctga 3 63 

Scoring table: I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 1792395 seqs, 1340900451 residues 

Total number of hits satisfying chosen parameters: 3584790 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 



Listing first 45 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : / cgn2 _6 /p t oda t a / 2 /pubpna / PCT_NEW_PUB .seq:* 

3 : /cgn2__6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq: * 

6: / cgn2_6/ptodat a/2 /pubpna /PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A__PUBCOMB. seq: * 
10: /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB .seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB . seq : * 
14: / cgn2_6 /p t oda ta / 2 /pubpna /US 1 0B_PUBCOMB . s eq : * 
15 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
16 : /cgn2__6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
17: / egn2_6 /p t oda t a / 2 /pubpna /US 6 0_PUBCOMB . s eq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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c 
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33 
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8 
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10 
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34 


45 
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12 


5 
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14 
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9 pm i pn hp 3 9 1 Ad 


35 


45 
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12 


5 
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14 
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36 
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12 


4 
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9 
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37 




45 


12 


4 
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10 
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45 


12 


4 
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12 
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39 


44 


.2 


12 


2 
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9 
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40 


44 


.2 
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2 
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9 
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41 


44 


.2 


12 


2 
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12 
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Sequence 1, Appli 


42 


44 


.2 


12 


2 
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9 
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Sequence 7, Appli 


43 


44 


.2 


12 


2 
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12 
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Sequence 7 , Appli 


44 


44 


.2 


12 


2 
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9 
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Sequence 256, App 


45 


44 


.2 


12 


2 
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9 


US-09-765-527-260 


Sequence 260, App 



ALIGNMENTS 



RESULT 1 

US-09-911-904-167 

Sequence 167, Application US/09911904 
Publication No. US20030096234A1 
GENERAL INFORMATION: 
APPLICANT : Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Nef t , Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER: US/ 09/ 9 1 1 , 9 04 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 386 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH: 489 
TYPE: DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME /KEY : misc_f eature 
LOCATION: (1) . . . (489) 
OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 



Query Match 77.5%; Score 281.2; DB 11; Length 489; 

Best Local Similarity 94.2%; Pred. No. 9e-74; 

Matches 2 92; Conservative 0; Mismatches 18; Indels 0; 



Gaps 



0; 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II IMIIIIIIIIIMIIII lllllllllllllllllllllll Mill llllllll 

101 CACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTG 160 



Qy 


114 


Db 


161 


Qy 


174 


Db 


221 


Qy 


234 


Db 


281 


Qy 


294 


Db 


341 


Qy 


354 


Db 


401 



GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

IIMIIIII Mill Mill Mill 1 1 II IIIIMIIIIIIIIMI Mill 

GAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTA 22 0 
CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

Ml Ml I Mill MIMM 1 1 1 M 1 1 F f 1 1 1 llllllllllllllllllll 

C^TTTGGAGCCTGGAC^CGCAGTAC^GC^GGTCCTGGCCCTGTAC^CCAGCACAACCC 2 8 0 
GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 34 0 
CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

1 1 1 ' 1 1 1 1 1 1 1 : 1 1 1 1 1 : 1 1 ! 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 II 1 .1111! I llllllll 

CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCTGCAA 4 00 



Illlllllll 



RESULT 2 

US-09-813-271B-1 

; Sequence 1, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/0 9/8 13 , 27 IB 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02 7 1 9 

FILING DATE: 12-Jul-95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25 -Jul -94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1 /USN 
TELECOMMUNICATION INFORMATION: 



TELEPHONE: (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . hTGF-betal (DSM 5656) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1 . .336 

OTHER INFORMATION: /product = "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 77.0%; Score 279.6; DB 10; Length 339; 

Best Local Similarity 93.9%; Pred . No. 2.5e-73; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllllllllllllllll llllllllllllll llllllll Mill 
CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Illllllll II II llllllllllllllllllll llllllll llllllll Mill 

GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 
C^TCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACC^GCAC^CCC 233 

III llllllll Mill IIIIIII MIMIIMIM llllllllllllll Mill 

CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 2 0 9 
GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

IMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIUM 

GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 
CTACGTGGGCCGCAAGCCC^GGTGGAGC^GCTGTCCAAC^TGATCGTGCGTTCCTGCAA 353 

1 1 M I II 1 1 M 1 1 1 II I II 1 1 II 1 1 1 1 1 1 1 1 II I II 1 1 1 1 M 1 1 1 1 II 1 1 1 llllllll 

CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 9 

GTGCAGCTGA 3 63 
Illlllllll 



Qy 


54 


Db 


30 


Qy 


114 


Db 


90 


Qy 


174 


Db 


150 


Qy 


234 


Db 


210 


Qy 


294 


Db 


270 


Qy 


354 


Db 


330 



RESULT 3 
US-10-087-268-1 

; Sequence 1, Application US/10087268 

; Publication No. US20030119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 



FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER: US/10/087,268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 1 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 
NAME /KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY: sig_peptide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME/ KEY: 3' UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-1 

Query Match 77.0%; Score 279.6; DB 14; Length 1821; 

Best Local Similarity 93.9%; Pred. No. 3.5e-73; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 1 1 1 1 1 1 1 Ml II I II M I il I II I 1 1 1 1 1 II 1 1 II 1 1 1 MINIM Mill 

Db 1375 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 1434 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIM M II IN II M III I II I II M 1 1 MINIM MINIM INN 

Db 143 5 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 94 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAA 233 

III I Mill I INN MM NINIIIMM II 1 1 1 1 1 1 1 1 1 1 1 1 INN 

Db 14 95 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 1554 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MINI II 1 1 II I N N 1 1 1 1 1 1 1 1 II 1 1 1 1 II I II I III II I 1 1 1 1 1 1 1 1 1 1 1 1 1 f 

Db 1555 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 1614 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

II 1 1 M I II I II I M I N I N 1 1 II II I Ml 1 1 1 1 II I III II I II 1 1 II MINIM 

Db 1615 CTACGTGGGCCG(^GCC(^GGTGGAGCAGCTGTC(^ACATGATCGTGCGCTCCTGCAA 1674 

Qy 354 GTGCAGCTGA 363 

INN III 

Db 1675 GTGCAGCTGA 1684 



RESULT 4 
US-10-087-268-4 

; Sequence 4, Application US/10087268 
; Publication No. US20030119010A1 



GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER : US/ 1 0/ 08 7 , 2 68 
CURRENT FILING DATE: 2 002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 4 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME/ KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY : s ig_pept ide 
LOCATION: (512) .. (598) 
OTHER INFORMATION: 
NAME/ KEY: 3 ' UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-4 

Query Match 77.0%; Score 279.6; DB 14; Length 1821; 

Best Local Similarity 93.9%; Pred . No. 3.5e-73; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II llllllllllillllllllllllllll Mllllllllllll llllllil Mill 

Db 1375 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 1434 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I Mill II II I 1 1 1 1 1 1 ' M 1 1 1 1 1 1 1 1 1 llllllil II I M M I Mill 

Db 143 5 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 94 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGG 233 

III llllllil Mill MIMM llllllllllll 1 1 . 1 1 1 1 1 1 1 ! 1 1 1 Mill 

Db 14 95 CATTTGGAGCCTGGACACGCAGTACAGC^GGTCCTGGCCCTGTACAACCAGCATAACCC 1554 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

Ml MMMIIMIMIMMMMMMMIIMMIMI 1 1 1 1 1 1 1 1 1 1 ! 1 1 

Db 1555 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 1614 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

I I I M I II II M I II I I I II II I II II II II II II II I I || I II II II I II llllllil 

Db 1615 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 1674 

Qy 354 GTGCAGCTGA 363 

II II II II II 

Db 1675 GTGCAGCTGA 1684 



RESULT 5 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi , Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: Til linghast , John 
APPLICANT: Drmanac, Rado]e T. 

TITLE OF INVENTION: No. US2 003 0104529Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/10/037,270 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER : 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 220 
LENGTH: 2742 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (842) . . (2014) 
US-10-037-270-220 



Query Match 77.0%; Score 279.6; DB 14; Length 2742; 

Best Local Similarity 93.9%; Pred . No. 3.9e-73; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

M 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 , II I M llllllllllllll llllllll Mill 

Db 17 05 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 1764 

Qy 114 GAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI II II MIMIIMMIIIIIIIII llllllll llllllll Mill 

Db 1765 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 1824 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCT 233 

Ml MMMM Mill 1 1 1 1 1 1 1 IMMIMMM llllllllllllll Mill 



Db 



1825 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC^ 1884 



Qy 2 34 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

MM II llill'IIIIIIIIIIIIIIMIIIIII II III llllllllllllll 

Db 188 5 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 1944 

Qy 294 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

llllllll Mill IMIIIIIIIIIMMIIIIMMI Mlllllll llllllll 

Db 194 5 CTACGTGGGCCGC^AGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 2004 

Qy 3 54 GTGCAGCTGA 3 63 

llllllll 
Db 2 0 05 GTGCAGCTGA 2 014 



RESULT 6 

US-09-948-002-28 

Sequence 28, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/ 09/ 94 8 , 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 28 
LENGTH: 2745 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (842) . . . (2017) 
US-09-948-002-28 

Query Match 77.0%; Score 279.6; DB 11; Length 2745; 

Best Local Similarity 93.9%; Pred. No. 3.9e-73; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II lllllllllllllllllllllllllll llllllllllllll llllllll Mill 
Db 17 08 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 1767 

Qy 114 GAAGTGGATTC^TGAACCC^AGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Mlllllll II II MIMMMIIIIIIIIIII MINIM llllllll Mill 

Db 1768 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 1827 

Qy 174 C^TCTGGAGCCTAGAC^CTCAGTACAAGAAGGTCCTGGCTC 233 

Ml llllllll Mill IIIIIM llllllllllll llllllllllllll Mill 

Db 1828 CATTTGGAGCCTGGAC^CGC^GTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 188 7 



Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

MINI llllllll II IMIIIIIIIIIIIIIIIMIIIIII 1 1 1 M I II I Mill 

Db 18 88 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 194 7 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

1 1 1 1 1 1; I II 1 1 1 1 1 1 1 1 M li 1 1 1 II M Ml 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 MMMM 

Db 194 8 CTACGTGGGCCGC^AGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 2 007 

Qy 3 54 GTGCAGCTGA 363 

MMMMM 

Db 2 008 GTGCAGCTGA 2 017 



RESULT 7 
US-09-948-002-1 

; Sequence 1, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M . Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
/ TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/94 8,002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 1 

LENGTH: 2 0 94 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (868) ... (2040) 
US-09-948-002-1 

Query Match 70.9%; Score 257.2; DB 11; Length 2094; 

Best Local Similarity 89.4%; Pred. No. 1.6e-66; 

Matches 277; Conservative 0; Mismatches 33; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I IIIIIMIIIIII Mill III lllllllllll Ml, II II 

Db 1731 CACAGAGAAGAA CTG CTG TGTG CGG CAG CTGTA CATTGA CTTTAGGAAGGA C CTGGGTTG 1790 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M II 1 1 Ml 1 1 M MIM IMM I ! 1 1 1 MMI 

Db 17 91 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGACCCTGCCCCTA 18 5 0 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTC 233 

1 1 MMMM MIM MMMI MMMMI M M MMMM MMMM 

Db 18 51 TATTTGGAGCCTGGACACACAGTAC^GCAAGGTCCTTGCCCTCTAO^CCAACACAACCC 1910 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

I M M I MMM I MIMMMMMMMMI MMMMMMMMMI 1 1 



Db 



1911 GGGCGCTTCGGCGTCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCATCGTCTA 197 0 



Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

Illllllll IIIIIMIIIMIIMIIMI llillllllllll Mill Illlllll 
Db 1971 CTACGTGGGTCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATTGTGCGCTCCTGCAA 2030 

Qy 3 54 GTGCAGCTGA 3 63 

IIIIIIIIM 
Db 2 031 GTGCAGCTGA 2 04 0 

RESULT 8 

US-09-948-002-27 

Sequence 27, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/948, 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 27 
LENGTH: 1585 
TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (413) . . . (1585) 
US-09-948-002-27 

Query Match 70.4%; Score 255.6; DB 11; Length 1585; 

Best Local Similarity 89.0%; Pred. No. 4.6e-66; 

Matches 276; Conservative 0; Mismatches 34; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II IIIIIIIIM ' 1 1 II IIIMIII MIMMIMI llillllllllll II 

Db 1276 CACAGAGAAGAACTGCTGTGTACGGCAGCTGTACATTGACTTTAGGAAGGACCTGGGTTG 133 5 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Illllllll II II llllllllllllllllllll Mill IIIIIIIIMI Mill 

Db 1336 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGGCCCTGCCCCTA 13 95 

Qy 174 C^TCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAA 233 

Ml Illlllll Mill lllllll Illllllll II II Illlllll Illlllll 

Db 13 96 CATTTGGAGCCTGGACACAC^GTACAGCAAC^ 14 55 

Qy 234 GGGCG CGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

Ml II II II I 1 1 1 1 1 ' I U N 1 1 1 II I II lllllllllllllllllll II 

Db 14 56 GGGTGCTTCCGCATCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCATCGTCTA 1515 



Qy 2 94 CTACGTGGGCCG(^GCC(^GGTGGAGC^GCTGTCa^(^TGATCGTGCGTTCCTG(^ 3 53 

Illllllll lllllllllllllllllllll lllllllllllllllllll llllllll 
Db 1516 CTACGTGGGTCGC^AGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAA 1575 

Qy 3 54 GTGCAGCTGA 363 

Illlllllll 
Db 1576 GTGCAGCTGA 158 5 



RESULT 9 

US-09-813-271B-7 

; Sequence 7 , Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alartis Patent Department 
; STREET: 5 64 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 
; DESCRIPTION: /desc = "recombinant hybrid DNA of 

IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.TGF-betal (44/4 5) beta3 
FEATURE : 



NAME/ KEY : mat_peptide 
LOCATION: 1..132 

OTHER INFORMATION: /product^ "N- terminal 44 amino 
acids of human TGF-betal" 
FEATURE : 

NAME/ KEY: mat_pept ide 
LOCATION: 133 . .336 

OTHER INFORMATION: /product= "C-terminal 68 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1..336 

OTHER INFORMATION: /product = "hybrid TGF-beta named 
TGF-betal-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 54.6%; Score 198.2; DB 10; Length 336; 

Best Local Similarity 77.9%; Pred. No. 3.7e-49; 

Matches 239; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 

Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I I I I I I I Ml 1. i I I I I I II I I M I IIIIIMMIIIM llllllll Mill 

Db 3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

Ml Mill II II 1 1 M II I II M Ml 1 1 II M MINI II II II 1 1 II 

Db 90 GAAGTGGATCC^CGAGCCC^GGGCTACC^TGCCAACTTCTGCTCAGGCCCTTGCCCATA 14 9 

Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCT 233 

I II I II MM Ml II Ml 1 1 1 1 llllllll I Mill 

Db 150 CCTCCGCAGTGCAGACACAACCCAC^GCACGGTGCTGGGACTGTACAACACTCTGAACCC 209 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

I II II II 1 1 1 1 Mill Mill 1 1 1 1 llllllll III Mill 1 1 1 1 

Db 210 TGAAGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCTGTA 26 9 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

III II II II Mill 1 1 1 1 1 1 M II I 1 1 1 r I E 1 1 1 I III II 1 1 II 

Db 27 0 CTATGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAACATGGTGGTGAAGTCTTGTAA 32 9 

Qy 354 GTGCAGC 3 60 

II III 

Db 33 0 ATGTAGC 336 



RESULT 10 
US-09-813-271B-5 

; Sequence 5, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION : 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES : 13 
CORRESPONDENCE ADDRESS: 



ADDRESSEE: No. US20020115834Alartis Patent Department 
STREET: 564 Morris Avenue 
CITY: Summit 
STATE: New Jersey 

COUNTRY: USA 
ZIP : 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 8 13 , 2 7 IB 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2 0 03 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.hTGF-beta3 (DSM 5658) 
FEATURE : 

NAME/ KEY: CDS 

LOCATION: 1. .336 

OTHER INFORMATION: /product^ "human TGF-beta3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-813-271B-5 

Query Match 52.7%; Score 191.4; DB 10; Length 339; 

Best Local Similarity 75.7%; Pred. No. 3.8e-47 ; 

Matches 237; Conservative 0; Mismatches 76; Indels 0; Gaps 0; 
Qy 51 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 110 



Db 



27 




86 



111 



CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 



170 



Db 



87 




146 



171 



CTACATCTGGAGCCTAGACACTCAGTAC^O^GGTCCTGGCTCTGTACAACCAGCACAA 



230 



Db 



147 




206 



Qy 231 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 290 

II! I II II II 1 1 1 1 lllllllllll 1 1 1 1 llllllll III Mil! I 

Db 2 07 CCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCT 266 

Qy 2 91 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 350 

MINI MM II lllll lllllllllll IIIIIIIII I Ml II II 

Db 267 GTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAACATGGTGGTGAAGTCTTG 32 6 

Qy 3 51 CAAGTGCAGCTGA 363 

II II llllll 
Db 327 TAAATGTAG CTG A 33 9 



RESULT 11 
US-09-906-158-3 

Sequence 3, Application US/09906158 
Publication No. US20030078217A1 
GENERAL INFORMATION: 
APPLICANT: Brett P. Monia 
APPLICANT: Susan M. Freier 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER: US/09/906,158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 3 
LENGTH: 2574 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 52.7%; Score 191.4; DB 11; Length 2574; 

Best Local Similarity 75.7%; Pred. No. 5.9e-47; 

Matches 237; Conservative 0; Mismatches 76; Indels 0; Gaps 0; 
Qy 51 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 110 

I II MM llllllllll Mill I 1 1 i 1 1 ! 1 1 ! ! 1 1 1 1 ! 1 1 MM Mill 

Db 118 0 CAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCTACATTGACTTCCGACAGGATCTGGG 123 9 

Qy 111 CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 170 

llllllllll I MMM MIIIMM IIIMM llllll II II II II 

Db 124 0 CTGGAAGTGGGTCCATGAACCTAAGGGCTACTATGCCAACTTCTGCTCAGGCCCTTGCCC 12 9 9 

Qy 171 CTACATCTGGAGCCTAGACACT(^GTA(^CAAGGTCCTGGCTCTGTAC^CCAGCAa^ 23 0 

III II I II MMM III II Ml MM MIIIMM I II 

Db 13 00 ATACCTCCGC^GTGC^GA(^CAACCC^C^G(^CGGTGCTGGGACTGTACAACACTCTGAA 1359 

Qy 231 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 2 90 

III I II II M MM lllllllllll MM llllllll Ml Mill I 

Db 13 60 CCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCT 1419 



Qy 



2 91 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 3 50 



Db 



1420 




Qy 



351 



CAAGTGCAG CTGA 3 63 



Db 



1480 




RESULT 12 
US-10-028-158-20 

; Sequence 20, Application US/10028158 
; Publication No. US20020110833A1 
; GENERAL INFORMATION: 

APPLICANT: Caniggia, Isabella 
; APPLICANT: Post, Martin 
; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER: US/ 10/ 028 , 158 

; CURRENT FILING DATE: 2001-12-20 

; PRIOR APPLICATION NUMBER: US/0 9/3 8 0 , 662 

; PRIOR FILING DATE: 1999-12-21 

; PRIOR APPLICATION NUMBER: PCT/CA98/00180 

; PRIOR FILING DATE: 1998-03-05 

PRIOR APPLICATION NUMBER: US 60/039,919 

PRIOR FILING DATE: 1997-03-07 
; NUMBER OF SEQ ID NOS : 24 
; SOFTWARE: Patent In version 3.0 
; SEQ ID NO 2 0 

LENGTH: 2 574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (254) . . (1492) 
US-10-028-158-20 

Query Match 52.7%; Score 191.4; DB 13; Length 2574; 

Best Local Similarity 75.7%; Pred. No. 5.9e-47; 

Matches 237; Conservative 0; Mismatches 76; Indels 0; Gaps 0; 
Qy 51 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 110 



Db 



1180 




Qy 



111 



CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 17 0 



Db 



1240 




Qy 



171 



CTACATCTGGAGCCTAGACACTC^GTACAACAAGGT 23 0 



Db 



1300 




Qy 



231 



CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 2 90 



Db 



1360 




Qy 2 91 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 3 50 

llllll II M II Mill lllllllllll lllllllll I III II II 
Db 1420 GTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAACATGGTGGTGAAGTCTTG 14 79 

Qy 3 51 CAAGTG CAG CTGA 363 

II II llllll 
Db 14 8 0 TAAATGTAGCTGA 14 92 



RESULT 13 
US-09-906-158-10 

Sequence 10, Application US/09906158 
Publication No. US20030078217A1 
GENERAL INFORMATION: 
APPLICANT: Brett P. Monia 
APPLICANT: Susan M. Freier 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER: US/09/906,158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 10 
LENGTH: 2879 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (611) . . . (1843) 
US-09-906-158-10 

Query Match 51.8%; Score 188.2; DB 11; Length 2879; 

Best Local Similarity 75.1%; Pred. No. 5.4e-46; 

Matches 235; Conservative 0; Mismatches 78; Indels 0; Gaps 0; 

Qy 51 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 110 

I M I I I I llllllllll II II I II II llllllllllll I I I I II II 
Db 1531 CAACCTGGAGGAGAACTGCTGTGTACGCCCCCTTTATATTGACTTCCGGCAGGATCTAGG 1590 

Qy 111 CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 170 

llllll III I II Mill Mill III lllllll llllll II II II II 

Db 1591 CTGGAAATGGGTCCACGAACCTAAGGGTTACTATGCCAACTTCTGCTCAGGCCCTTGCCC 165 0 

Qy 171 CTA(^TCTGGAGCCTAGA(^CTCAGTAC^Q^AGGTCCTGGCTCTGTACAACCAGCACAA 23 0 

III II I Ml MUM I I II III II I II llllll I II 

Db 1651 ATACCTCCGCAGCGC^GA^OUICCCATAGCACGGTGCTTGGACTATACAACACCCTGAA 1710 

Qy 231 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 290 

Ml I Mill II MM MMMM II MM MMMM Ml Mill I 

Db 1711 CCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTT 1770 

Qy 291 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 3 50 

MUM llllll I I IMMIIIIIIIIIIIIIIIIIIIIII I III II II 
Db 1771 GTACTATGTGGGCAGAACCCCCAAGGTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTG 183 0 



Qy 



351 CAAGTG CAG CTGA 3 63 



Db 




RESULT 14 
US-09-957-458B-9 

; Sequence 9, Application US/09957458B 
; Publication No. US20030166271A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Una 

TITLE OF INVENTION: Method for growing stem cells 
; FILE REFERENCE: P66567US0 

; CURRENT APPLICATION NUMBER: US/09/957 , 458B 

; CURRENT FILING DATE: 2001-09-21 

; PRIOR APPLICATION NUMBER: PCT/EPOO/08247 

; PRIOR FILING DATE: 2000-08-24 

; PRIOR APPLICATION NUMBER: EP 99116533 

; PRIOR FILING DATE: 1999-08-24 

; NUMBER OF SEQ ID NOS : 10 

; SOFTWARE: Patent In version 3.2 

; SEQ ID NO 9 

LENGTH: 4382 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign to 
express 

; OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 51.8%; Score 188.2; DB 12; Length 4382; 

Best Local Similarity 75.1%; Pred. No. 5.9e-46; 

Matches 235; Conservative 0; Mismatches 78; Indels 0; Gaps 0; 
Qy 51 CGAGAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 110 



Db 




Qy 



111 CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 170 



Db 




Qy 



171 CTACATCTGGAGCCTAGACACT(^GTA(^CAAGGTCCTGGCTCTGTAC^CCAGCACAA 230 



Db 



1504 ATACCTCCGCAGCGCAGACACAACCCATAGCACGGTGCTTGGACTATACAACACCCTGAA 




Qy 



231 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 2 90 



Db 




Qy 



2 91 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 35 0 



Db 




Qy 

Db 



3 51 CAAGTG CAG CTGA 363 

IIIIIIIIMII 

1684 TAAGTG CAG CTGA 1696 



RESULT 15 
US-09-813-271B-9 

; Sequence 9, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE : No. US20020115834Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY : Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/0271 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.TGF-beta2 (44/45) beta3 
FEATURE : 

NAME/KEY: mat_peptide 

LOCATION: 1..132 

OTHER INFORMATION: /product= "N- terminal 44 amino 
acids of human TGF-beta2" 
FEATURE : 

NAME/KEY: mat_peptide 
LOCATION: 133 . .336 



OTHER INFORMATION: /product= "C- terminal 68 amino 
acids of human TGF-beta3 ,f 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1..336 

OTHER INFORMATION: /product^ "hybrid TGF-beta2-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
US-09-813-271B-9 

Query Match 46.7%; Score 169.6; DB 10; Length 336; 

Best Local Similarity 72.4%; Pred. No. l.le-40; 

Matches 22 0; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 
Qy 57 GGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAA 116 

I II I II llllll I II I II MINIM III II III II II Mill 

Db 33 GCAGGATAATTGCTGCCTACGTCCACTTTACATTGATTTCAAGAGGGATCTAGGGTGGAA 92 

Qy 117 GTGGATT CATGAACCCAAGGG CTAC CATG C CAATTT CTG CCTGGGG C CCTGT CC CTA CAT 176 

Mill II llllllll II III Mill MUM II II II II III I 

Db 93 ATGGATACACGAACCCAAAGGGTACAATGCCAACTTCTGCTCAGGCCCTTGCCCATACCT 152 

Qy 177 CTGGAGCCTAGA(^CTCAGTAC^C^GGTCCTGGCTCTGTA(^CCAGCACAACCCGGG 236 

I I II llllll III M III MM 1 1 ! I r 1 1 1 1 I Mill I 

Db 153 CCGGkGTGCAGACACAACCCACAGCAC^ 212 

Qy 237 CGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTA 2 96 

M II II MM MMIIIMM MM llllllll Ml Mill 

Db 213 AGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCTGTACTA 272 

Qy 2 97 CGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTG 356 

MM II Mill 1 1 1 E 1 1 1 1 1 f I llllll 1 1 1 I III II II II II 

Db 2 73 TGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAA.CATGGTGGTGAAGTCTTGTAAATG 332 

Qy 357 CAGC 360 

III 

Db 333 TAGC 336 



Search completed: October 28, 2003, 09:04:19 
Job time : 113.037 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 900.804 Seconds 

(without alignments) 
9794.056 Million cell updates/sec 

Title: US-10-017-372E-14 
Perfect score: 363 

Sequence: 1 gccctggataccaactactg gttcctgcaagtgcagctga 363 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : EST:* 

1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6: em_estpl:* 

7: em_estro:* 

8: em_htc:* 

9: gb_estl:* 
10: gb_est2:* 
11: gb_htc:* 
12: gb_est3:* 
13: gb_est4:* 
14: gb_est5:* 
15: em_estfun:* 
16: em_estom:* 
17: em_gss_hum:* 
18 : em_gss_inv : * 
19: em_gss_pln:* 
20: em_gss_vrt:* 
21: em_gss_fun:* 
22: em_gss_mam:* 
23: em_gss_mus:* 
24: em_gss_pro:* 
25: em_gss__rod : * 
26: em_gss_phg:* 
27: em_gss_vrl:* 
28: gb_gssl:* 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO. 


Score 


Match Length 


DB 


ID 


Description 


c 1 


279.8 


77 . 1 


546 


9 


AI760533 


AI760533 wh88b09.x 


C 2 


279.8 


77. 1 


954 


13 


BX392116 


BX392116 BX392116 


3 


279.6 


77 . 0 


396 


12 


BI868266 


BI868266 603392221 


4 


279.6 


77 . 0 


400 


12 


BI820759 


BI820759 603034355 


5 


279.6 


77. 0 


431 


10 


BG180040 


BG180040 602329596 





r 
D 


279 . 


6 


77 , 




A (Z 1 

461 


1 0 
iz 


D M ~7 /I nCT7 

bM / 4 Ubo / 


BM74 053 7 


1/ r"T< A A 7 7 

K- hSTU U 1 1 


c 


7 


279 . 


6 


77 . 


. 0 


4 75 


13 


bUbozy4 1 


BU632941 


UI-H-DF0- 


c 


o 

8 


a i a 

279 . 


6 


77 . 


, 0 


/I O A 

4 0 U 


A 

y 


ai U8 y y U4 


AI089904 ( 


qal6a08 .x 




9 


279 . 


6 


77 


. 0 


r a r 

505 


10 


AW 9 58 056 


AW958056 


EST370126 




1 0 


279 . 


6 


77 


. 0 


536 


13 


BQzZZZ 05 


BQ222205 


AG EN COURT 


c 


1 1 


z / y . 


r 
O 


""7 "I 




C "3 O 
DO O 


1 z 


DM/; -7 A /I A A 

bMb / U4 Uz 


BM670402 


UI-E-DW1- 




1 a 

IZ 


z / y . 


b 


-7 -7 


A 


C /I A 

b4 U 


IZ 


Di y O 4 /OZ 


BI834732 


603090326 


c 


1 J 


z / y . 


b 


/ / . 


. u 


con 
bo / 


Iz 


byu 1 bUZ / 


BQ016027 


UI-H-DT1- 


c 


1 A 

14 


z /y . 


b 


/ / . 


A 


cr q "3 

byj 


IZ 


O A A O 7 "3/^ 

byu Uo lo 6 


BQ003136 


UI-H-EI1- 


c 


1 a 

1 b 


O "7 Q 

z / y . 


b 




A 


C Q Q 

b y 0 


Q 

y 


ATQCT7 Q 7 1 

Al y b loo 1 


AI951831 wx38b08.x 




1 C 


z / y . 


b 


"7 "7 

/ / , 


, u 


CQQ 

b y y 


1 o 
1Z 


bi yubybo 


oiyUDy DO 


cm n co ft a q 

DUOUt3Zo4i7 


c 


1 / 


z / y . 


b 


1 1 . 




a a a 
bUU 


1 A 
1 U 


br /z b y y b 


DDI T^QQC 


V^xrl C/-^A'3 \r 

vy 1 bcuo . y 


c 


1 Q 


z /y . 


b 


/ / . 




bib 


IZ 


DMiT 7 / A O ~7 

bNb /4 y 0 / 


PMC 7 4 QQ'7 

di v id / 4 y O / 


T TT f D T A 

U 1 - h - CiJ U - 


c 


1 A 

l y 


27 9 . 


b 


77 . 


. 0 


62 5 


14 


CA4 2 6732 


L_R4 ZO / 


u l - n - r hi 1 - 


c 


z U 


z / y . 


b 


/7 . 


A 


a a q 

64 y 


14 


CJA4Zo5o 9 


r*7A A O 'J c: O Q 
LA4 Z 0 DO y 


TTT TJ CT? 1 
Ul -hi- rhl - 


c 


Z X 


O 1 Q 

z / y . 


r 

b 


H "7 


A 

, (J 


bb4 


lz 


BM68 1/5 0 


bivib 0 1 / b U 


TTT V T7 T A 


c 


z Z 


z / y . 


b 


1 1 . 


A 


bbZ 


lo 


byb / bZZ b 


rJ^D / b Z Z D 


TTT U 1 

ul - M- hiZi 1 - 


c 


Z J 


z / y . 


b 


"7 H 


A 


an a 
b /4 


1 A 

1 4 


f*A C~ A "3 A O C 

LAb Uo U 0 b 


PA c: A "3 A Q C 


TTT HZ? TJMfi 


c 


O /I 

Z4 


z7 y . 


b 


77 . 


A 


676 


13 


BU73 754 5 


DI T7 3*7C/] C 

bU / 0 / D4 b 


TTT I? P^TaT 1 

U 1 - hi - UW 1 - 


c 


Z D 


z / y . 


b 


1 1 . 


A 
U 


bit 


7 /I 

14 


LA4 0 Zb8 1 


r"7A A 3 7CO 1 
UA4 OZ DO 1 


TTT U CT 1 

Ul -ri-r 1j1 - 


c 


o C 
Z b 


z / y . 


b 


/ / . 


A 

u 


b /o 


IZ 


□ Am 7 CZ ~3 1 

byuz 1 bo / 


ra^UZ 1 DO / 


T TT _u nui 


c 


Z / 


z / y . 


b 


"7 "7 


A 

u 


by u 


lo 


QTT£TOC:'3'7'7 

bUbZ bo / / 


bUbZ bo / / 


TTT U DPI 

U 1 -H- r <jl - 


c 


Z o 


z / y . 


b 


"7 "7 


A 

. u 


b y b 


7 /I 

14 


LA4 Z b / / b 


CA425775 


UI-H-FE1- 


c 


z y 


z / y . 


b 


"7 "7 

/ / . 


A 


by / 


13 


bUbo 0 bo b 


BU633535 


UI-H-FL1- 


c 


i a 
0 U 


z / y . 


b 


1 *7 

/ / , 


A 


by / 


1 A 

14 


CJA4 Z 63 9 1 


CA426391 


UI-H-FE1- 


c 


O 7 
3 1 


z7 y . 


b 


/ / . 


A 


—7 A A 

/uo 


13 


ni t r~ T T nil 

BU633 922 


BU633922 


UI-H-FL1- 


c 


"5 O 
OZ 


ATA 

z / y . 


b 


I 1 . 


A 


/zz 


lo 


BU63Z686 


BU632686 


UI-H-FE1- 


c 


"3 *3 


z / y . 


b 


/ / . 


A 


/b / 


7 A 
14 


A "3 AQ"70 7 

cjao u y /o 1 


CA309731 


UI-H-FT1- 


c 




279. 


6 


77 . 


A 


0 0 0 

888 


13 


BX4 3 44 2 5 


BX434425 


BX434425 


c 


J b 


279 . 


6 


"7 1 


A 


y z 8 


13 


Ba4 1z4o4 


BX412434 


BX412434 


c 


1 c 
o b 


279 . 


6 


*7 "7 


A 


y b b 


1 0 
IZ 


DTAO/1*77 D 
blU84 /18 


BI084718 


602869722 




"3 "7 
J / 


279 . 


6 


*7 T 
/ / . 


A 


7 1 O 7 

1181 


12 


t3h/fO A C A "~7 "I 

BM8 0 5377 


BM805377 


AGENCOURT 




38 


278 . 


6 


76. 


7 


427 


12 


BM738551 


BM738551 


K-EST0006 




39 


278 . 


6 


76 . 


7 


451 


12 


BM717493 


BM717493 


UI-E-EJ0- 




40 


278 . 


6 


76. 


7 


1093 


12 


BQ054305 


BQ054305 


AGENCOURT 


c 


41 


278 . 


6 


76. 


. 7 


1201 


13 


BX355682 


BX355682 


BX355682 


c 


42 


278 


76. 


6 


474 


10 


BE645704 


BE645704 


7e74hl0.x 


c 


43 


278 


76, 


6 


583 


13 


BU625902 


BU625902 


UI-H-FG1- 


c 


44 


278 


76. 


6 


655 


13 


BU624264 


BU624264 


UI-H-FG1- 


c 


45 


278 


76. 


6 


659 


14 


CA423193 


CA423193 


UI-H-FE1- 



ALIGNMENTS 



RESULT 1 

AI760533/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AI760533 546 bp mRNA linear EST 20-DEC-1999 

wh88b09.xl NCI_CGAP_CLL1 Homo sapiens cDNA clone IMAGE : 2387801 3' 
similar to gb: X02812_cdsl TRANSFORMING GROWTH FACTOR BETA 1 
PRECURSOR (HUMAN) /contains element TAR1 repetitive element ;, mRNA 
sequence. 
AI760533 

AI760533 .1 GI : 5176200 
EST . 

Homo sapiens (human) 
Homo sapiens 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Eukaryota; Metazoa; Chordata ; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 546) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

Tissue Procurement: Ash Alizadeh, John Byrd, M.D., Mike Grever, 
M.D., Louis M. Staudt, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio. llnl . gov/ bbrp/ image/ image.html 
Insert Length: 714 Std Error: 0.00 
Seq primer: -40UP from Gibco 
High quality sequence stop: 465. 
Location/Qualifiers 
1. .546 

/organism= M Homo sapiens" 
/mol__type=" mRNA 11 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 238 78 01" 

/tissue__type="B-cell , chronic lymphotic leukemia" 

/lab_host="DH10B" 

/ c 1 one_l i b = " NC I _CGA P_CLL 1 " 

/note="Vector : pT7T3D-Pac (Pharmacia) with a modified 
polyl inker; Site_l: Not I; Site_2: Eco RI; 1st strand cDNA 
was primed with a Not I - oligo(dT) primer [5 1 
TGTTACCAATCTGAAGTGGGAGCGGCCGCATTGCTTTTTTTTTTTTTTTTTTTTTTTT 
T 3']; double-stranded cDNA was ligated to Eco RI 
adaptors (Pharmacia) , digested with Not I and cloned into 
the Not I and Eco RI sites of the modified pT7T3 vector. 
Library is normalized, and was constructed by Bento 
Soares and M.Fatima Bona 1 do." 
86 a 157 c 204 g 98 t 1 others 



Query Match 77.1%; Score 279.8; DB 9; 

Best Local Similarity 89.1%; Pred. No. 1.5e-60; 
Matches 302; Conservative 0; Mismatches 37; 



Length 54 6; 

Indels 0; Gaps 0; 



Qy 
Db 



2 5 AGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTC 84 

Ml I II II I I I II MM Ml III II II, MINIM III I 

4 76 AGCCCTGGACACAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 417 



Qy 



Db 



85 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 144 

MMMMMMM MINIM 1 1 1 1 1 1 1 II 1 1 1 1 1 II I! 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

416 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 357 



Qy 



Db 



14 5 GC CAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAACAAG 204 

Mill llllllll MINIM MINIM 1 1 1 1 1 1 1 1 II II I MINN 1 1 1 1 

3 56 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 297 



Qy 205 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 264 

MINIM f J 1 1 1 1 f 1 1 1 1 1 1 1 MIMIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 2 96 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 237 

Qy 265 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 324 

1 1 r j i r 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 j 1 1 1 1 1 [ i f 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 

Db 236 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 177 

Qy 325 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 363 

IIMMMMMIIMIMI MMMMMMMMM 

Db 176 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 13 8 



RESULT 2 

BX392116/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX392116 954 bp mRNA linear EST 13-MAY-2003 

BX392116 Homo sapiens NEUROBLASTOMA COT 25 -NORMALIZED Homo sapiens 
cDNA clone CS0DC005YF22 3 -PRIME , mRNA sequence. 
BX392116 

BX392116.1 GI :30611808 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 954) 
Li,W.B. , Gruber,C. , 



FEATURES 

source 



Craniata ; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



Jessee, J. 

Full-length cDNA libraries and normalization 

Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref ©genoscope . ens . f r , Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / /www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0BAK013CE06NMl&cluster=9160 . r. Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK013CE06NM1 . 

Location/Qualif lers 

1. .954 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 9606" 
/clone=" CS0DC005YF22" 

/tissue^ type=" NEUROBLASTOMA COT 25 -NORMALIZED" 
/clone_iib="Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
187 a 258 c 307 g 202 t 



Query Match 



77.1%; Score 279.8; DB 13; Length 954; 



Best Local Similarity 93.0%; Pred. No. 1.9e-60; 

Matches 293; Conservative 0; Mismatches 22; Indels 0; Gaps 0; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



4 9 GACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTG 108 

II II ! . I . ' I 1 I I I I I I I I i ' I , I ' I llllllllllllll llilllll 

4 51 GATTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTC 392 

109 GGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGT 168 

llllllllllllll II 1 1 llllllllllllllllllli MINIM llilllll 

391 GGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGC 332 
169 CCCTACATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCAC 228 

llilllll llilllll Mill I ; 1 1 1 1 1 M Ml MINIMUM 

331 CCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCAT 272 
229 AACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATC 288 

Mlllllllll 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 : 1 ! 1 ' 1 , 1 1 1 1 1 1 1 , ! 1 1 1 1 Ml, MM 

271 AACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATC 212 
28 9 GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCC 348 

IMIIIIIIMMMMMIMIIIIIIIIIIIMMIIIIIIIIMIIIIIIIII III 

211 GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCC 152 
34 9 TG CAAGTG CAG CTGA 363 

1 1 1 1 1 1 1 1 1 M M 1 1 

151 TG CAAGTG CAG CTGA 137 



RESULT 3 
BI868266 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BI868266 396 bp mRNA linear EST ll-OCT-2001 

603392221F1 NIH_MGC_90 Homo sapiens cDNA clone IMAGE : 5402212 5\ 
mRNA sequence. 
BI868266 

BI868266 .1 GI : 16041939 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 396) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM12025 row: j column: 05 
High quality sequence stop: 383. 

Loca t i on/ Qua 1 i f i er s 

1. .396 



BASE COUNT 
ORIGIN 



/organism= M Homo sapiens" 

/mol__type="mRNA" 

/ db_x r e f = " t a xon : 9 6 0 6 " 

/clones" IMAGE: 5402212" 

/tissue__type= "adenocarcinoma , cell line" 
/lab__host = "DH10B (phage-resistant) " 
/clone_lib="NIH_MGC_90" 

/note="Organ: liver; Vector: pCMV-SPORT6; Site_l: NotI; 
Site_2 : Sail; Cloned unidirectionally; oligo-dT primed. 
Average insert size 1.7 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIHJYIGC Library." 
75 a 135 c 121 g 65 t 



Query Match 77.0%; Score 279.6; DB 12; Length 396; 

Best Local Similarity 93.9%; Pred. No. 1.5e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



54 CAAGGAGAAG AACTG CTG CGTG CGG CAG CT CTA CATTGACTT C CGGAAGG A C CTGGG CTG 113 

II MIIIIIIMIIIIIIIIIIIIIIIII llllllllllllll llllllll Mill 

7 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 12 9 

114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI M II 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 II I II M I M I llllllll Mill 

13 0 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 18 9 

174 CATCTGGAGCCTAGACACTCAGTACAACA^^ 233 

Ml ,' 1 1 1 1 1 Mill 1 1 1 1 1 1 1 MIMMIMM 1 1 1 1 1 1 M 1 1 M I i Mill 

19 0 CATTTGGAGCCTGGAC^CGC^GTAC^GCAAGGTCCTGGCCCTGTAC^ACCAGCATAACCC 24 9 

234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

MINI IIMMMIIMIMMMIIIIIMIIIMMIMM MIMMIMMM 

2 5 0 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 3 0 9 



Qy 
Db 

Qy 

Db 



2 94 CTACGTGGGCCGCAAGCC(^GGTGGAGC^GCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

IIIMIIIIIIIIIMIMIIMIIIIMIMMIIIIIMIMIIIIMI llllllll 

310 CTACGTGGGCCG(^GCCC^GGTGGAGCAGCTGTC(^(^TGATCGTGCGCTCCTGC^A 369 

3 54 GTGCAGCTGA 3 63 

Illlllllll 
3 70 GTGCAGCTGA 379 



RESULT 4 
BI820759 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



BI820759 400 
603034355F1 NIH_MGC 115 Homo 
mRNA sequence. 
BI820759 

BI 82 0759. 1 GI : 15 932309 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 400) 



bp mRNA 
sapiens cDNA 



linear EST 04-OCT-2001 
Clone IMAGE: 5175732 5 ' ; 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



AUTHORS 
TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM11437 row: m column: 13 
High quality sequence stop: 364. 

Locat ion/Qual i f iers 

1. .400 

/organism="Homo sapiens" 

/ mo 1 _ t yp e = " mRN A " 

/db_xref="taxon: 96 06" 

/clone=" IMAGE: 5175732" 

/labJiost="DH10B" 

/ c 1 one_l ib= " NI H_MGC_1 1 5 " 

/note="0rgan: pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l : Not I ; Site_2: EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and directionally cloned (EcoRV site is 
destroyed upon cloning). Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH_MGC Library." 
73 a 131 c 131 g 65 t 



Query Match 77.0%; Score 279.6; DB 12; Length 400; 

Best Local Similarity 93.9%; Pred. No. 1.5e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1'l II MINIMUM MINIM Mill 

4 9 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 108 
114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 M M II II M M II 1 1 1 Ml II M 1 1 M I ! 1 1 1 1 1 1 IIIMIII Mill 

109 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 168 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTA 233 

Ml Mill! IMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMIIIIIIMI IMM 

169 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 22 8 
234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

M II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 Ml I M I ! 1 1 1 1 

22 9 GGGCG C CTCGG CGG CG CCGTG CTG CGTG CCG CAGGCG CTGGAG C CG CTG C C CAT CGTGTA 28 8 
2 94 CTACGTGGGCCG(^GCCCAAGGTGGAGCA.GCTGTCC^CATGATCGTGCGTTCCTGCAA 353 

1 1 1 II 1 1 M M I M 1 1 1 M M 1 1 1 1 M 1 1 1 1 M M II II M 1 1 M I M II I MM M M 



Db 



28 9 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 34 8 



Qy 

Db 



3 54 GTGCAGCTGA 363 

II II M II 1 1 

34 9 GTGCAGCTGA 358 



RESULT 5 
BG180040 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BG180040 431 bp mRNA linear EST 06-FEB-2001 

602329596F1 NIH_MGC_91 Homo sapiens cDNA clone IMAGE : 443 12 14 5', 
mRNA sequence. 
BG180040 

BG180040. 1 GI : 12686743 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 431) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: DCTD/DTP 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : // image . llnl .gov 
Plate: LLAM10185 row: o column: 23 
High quality sequence stop: 338. 

Location/Qualifiers 

1. .431 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="IMAGE:4431214" 

/tissue_type="adenocarcinoma, cell line" 
/lab_host="DH10B (phage-res istant ) " 
/ c 1 one_l ib= " NI H_MGC_9 1 " 

/note="0rgan: prostate; Vector: pCMV-SP0RT6; Site_l : NotI; 
Site_2: Sail; Cloned unidirectionally ; oligo-dT primed. 
Average insert size 1.4 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIH MGC Library." 
95 a 126 c 148 g 62 t 



Query Match 77.0%; Score 279.6; DB 10; Length 431; 

Best Local Similarity 93.9%; Pred. No. 1.6e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 



54 CAAGGAGAAGAA CTG CTG CGTG CGG CAG CTCTA CATTGACTT CCGGAAGGACCTGGG CTG 113 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 ! 1 1 f 1 1 II Mi I MINIM Mill 



Db 



28 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 87 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MINIMI 1 1 1 1 IMIIIIIIIMIIIIIIII MMMII MINIM Mill 

88 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 7 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

Ml MMMII INN lllllll MINIMUM IIIIIIIIIIIIII I II 1 1 

14 8 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 2 07 



234 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

INIM IIIINNIIIINIIIIIIIIIIIIIIIIIIIINII NIMIMMIMI 

2 08 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 267 
2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

1 1 M 1 1 1 1 II I II I M 1 1 M M 1 1 1 1 1 1 1 1 1 1 M M M II II M 1 1 M II I llllllll 

2 68 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 32 7 



354 



363 



GTGCAGCTGA 

INIINIM 

32 8 GTGCAGCTGA 337 



RESULT 6 
BM740537 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



BM740537 



FEATURES 

source 



461 bp 



mRNA 



linear EST 01-MAR-2002 



Y. , Kim, M . R . , 
Kim,S. and 



K-EST0011722 S1SNU5 Homo sapiens cDNA clone S1SNU5-5-B09 5', mRNA 

sequence . 

BM740537 

BM74 0537. 1 GI : 190618 66 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 461) 

Kim,N.S., Hahn,Y., Oh,J.H., Lee,J.Y., Ahn,H.Y., Chu,M 
Oh,K.J., Cheong,J.E., Sohn,H.Y., Kim,J.M., Park,H.S., 
Kim, Y.S. 

21C Frontier Korean EST Project 2001 
Unpublished 
Contact : Kim YS 
Genome Research Center 

Korea Research Institute of Bioscience & Biotechnology 
52 Eoeun-dong Yuseong-gu, Daejeon 305-333, South Korea 
Tel : +82-42-860-4470 
Fax: +82-42-860-4409 
Email : yongsungSmail . kribb. re.kr 
Plate: 5 row: B column: 09 
High quality sequence stop: 461. 

Location/Qualifiers 

1. .461 

/organism= M Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="SlSNU5-5-B09" 
/sex="F" 



BASE COUNT 
ORIGIN 



/tissue_type= "Ascites" 

/eel l_type= " Lymphobl as t -like" 

/cell__line="SNU-5" 

/lab_host="ToplOF ' " 

/clone_lib="SlSNU5" 

/note= "Organ : Stomach; Vector: pCNS; Site_l: EcoRI; 
Site_2: Not I; The poly (A) + RNA was dephosphorylated with 
bacterial alkaline phosphatase (BAP) and then decapped 
with tabacco acid pyrophosphatase (TAP) . The decapped 
intact mRNA was ligated with DNA-RNA linker including EcoR 
I site by treatment of T4 RNA ligase and the first strand 
cDNA was synthesized from oligo dT-selected mRNA by 
priming with dT-tailed vector. The dT- tailed vector was 
adjusted to have about 60nt. The cDNA vector was 
circularized with E. coli DNA ligase after digestion of 
EcoRI which site is also included in vector. An RNA strand 
converted to a DNA strand by Okayama-Berg method. The 
obtained cDNA vectors were used for transformation of 
competent cells E. coli ToplOF 1 by electroporat ion method. 
The cDNA libraries constructed by this method are 
full-length enriched cDNA library." 
86 a 161 c 133 g 81 t 



Query Match 77.0%; Score 279.6; DB 12; Length 461; 

Best Local Similarity 93.9%; Pred. No. 1.6e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 


54 


Db 


148 


Qy 


114 


Db 


208 


Qy 


174 


Db 


268 


Qy 


234 


Db 


328 


Qy 


294 


Db 


388 


Qy 


354 


Db 


448 



II II III III jlllllMIMMIIMI MINN INI MINI! Mill 



MINIM II II 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 Mm 1 1 1 1 1 1 1 1 Mill 



Ml MINIM INN II i IMMIMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 



MM 



1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 



IMMIIIIIIIIM 



INN 



lllllll IIMMINN IMIIIIIIIII MINIM 



RESULT 7 
BU632941/C 

LOCUS BU632941 475 bp mRNA linear EST 23-SEP-2002 

DEFINITION UI-H-DF0-bel-c-18-0-UI.sl NCI_CGAP_DF0 Homo sapiens cDNA clone 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



UI-H-DF0-bel-c-18-0-UI 3', mRNA sequence. 
BU632941 

BU632 941 . 1 GI : 2330 0196 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates / Catarrhini; Hominidae; Homo. 
1 (bases 1 to 475) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be obtained 

from Dr. M. Bento Soares, bento-soares@uiowa.edu 
The following repetitive elements were found in this cDNA 

sequence : 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 FORWARD 

POLYA=Yes . 

Loca t ion/ Qua 1 i f iers 
1. .475 

/organism="Homo sapiens" 
/mol__type=" mRNA " 
/db_xref= M taxon: 9606" 
/clone="UI -H-DF0-bel-c-18-0-UI" 
/tissue_type=" Subchondral Bone" 
/dev_stage= "Adult" 

/lab_host="DH10B (Life Technologies) " 
/clone_lib="NCI_CGAP_DFO" 

/note="0rgan: Bone; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l: EcoR I; Site_2: Not I; 
NCI_CGAP_DF0 is a cDNA library containing the following 
tissue(s): Subchondral Bone. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is GTTAAGCGTC. 
TAG__LIB=UI-H-DF0 
TAG_TISSUE=subchondral bone 
TAG_SEQ=GTTAAGCGTC " 
70 a 136 c 171 g 98 t 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 13; Length 475; 
Pred. No. 1.6e-60; 
0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 


54 


Db 


465 


Qy 


114 


Db 


405 


Qy 


174 


Db 


345 


Qy 


234 


Db 


285 


Qy 


294 


Db 


225 


Qy 


354 


Db 


165 



CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I II 1 1 1 Ml 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 MINIM Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 4 06 
GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II Ml III M 1 1 1 II I MINIM llllllll INN 

GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 34 6 



III llllllll Mill MINN llllllllllll IMMIMMIMI Mill 



WWW 1 1 1 II M 1 1 1 1 1 II I MM i I M 1 1 1 1 1 M II 1 1 1 1 II Mlllll MM 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 3 53 

1 1 1 M 1 1 M M 1 1 1 M i M M I M M M 1 1 M M M II 1 1 M MMM llllllll 

CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 16 6 



INIINII 



RESULT 8 

AI089904/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
COMMENT 



AI089904 480 bp mRNA linear EST 01-OCT-1998 

qal6a08.xl NCI_CGAP_Brn23 Homo sapiens cDNA clone IMAGE : 1686902 3' 
similar to gb: X02812_cdsl TRANSFORMING GROWTH FACTOR BETA 1 
PRECURSOR (HUMAN) /contains element MER22 repetitive element ;, mRNA 
sequence. 
AI089904 

AI089904 . 1 GI :3428963 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 480) 

NCI /NINDS-CGAP http : //www. ncbi . nlm . nih . gov/ncicgap . 
National Cancer Institute / National Institute of Neurological 
Disorders and Stroke, Brain Tumor Genome Anatomy Project 
(CGAP/BTGAP) , Tumor Gene Index 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih .gov 

Tissue Procurement: David N. Louis, M.D., Myrna R. Rosenfeld M.D., 
Ph . D . 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 
cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

www-bio . llnl . gov/bbrp/ image/ image . html 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Insert Length: 895 Std Error: 0.00 
Seq primer: -4 0ml3 f wd . ET from Amersham 
High quality sequence stop: 414. 

Location/Qualifiers 

1. .480 

/organism="Homo sapiens" 

/ mo 1 _ t ype = " mRNA " 

/db__xref="taxon: 9606" 

/clone="IMAGE: 1686902" 

/tissue_type="glioblastoma (pooled) " 

/lab_host= M DH10B" 

/clone_lib="NCI_CGAP_Brn23" 

/note="Organ: brain; Vector: pT7T3D-Pac (Pharmacia) with a 
modified polylinker; Site_l : Not I; Site_2: Eco RI ; 1st 
strand cDNA was primed with a Not I - oligo(dT) primer [5' 
TGTTACCAATCTGAAGTGGGAGCGGCCGCATATCTTTTTTTTTTTTTTTTTTTTTTTT 
T 3*]; double- stranded cDNA was ligated to Eco RI 
adaptors (Pharmacia) , digested with Not I and cloned into 
the Not I and Eco RI sites of the modified pT7T3 vector. 
Library is normalized, and was constructed by Bento 
Soares and M.Fatima Bonaldo." 
74 a 140 c 179 g 87 t 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 9; Length 480; 
Pred. No. 1.6e-60; 
0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II IIIIIIIIIIIIIIIIIIIIIIIIIM llllllllllllll llllllll Mill 
4 48 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 38 9 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

I I It I M It II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM MMMM Mill 

3 88 GAAGTGGATCC^CGAGCCC^AGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 32 9 

174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCT 233 

III llllllll Mill IIIIIM llllllllllll MIMMMMIM Mill 

3 28 C^TTTGGAGCCTGGACACGC^GTAC^GCAAGGTCCTGGCCCTGTAC^CCAGCATAACCC 2 69 



234 



268 



294 



208 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

MUM I I I I , I I I , I I I I I I . I I I I I I I I I I I I I I I ' I ' I I I llllllllllllll 
GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 



293 



209 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

I ' 1 1 1 1 1 1 1 1 1 1 ; 1 1 : I ' 1 1 1 1 : 1 1 1 : 1 ; ' 1 1 ! 1 1 1 1 1 1 MMMM 

CTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCa^C^TGATCGTGCGCTCCTGC^A 14 9 



3 54 GTGCAGCTGA 363 

Minimi 

14 8 GTGCAGCTGA 13 9 



RESULT 9 
AW958056 

LOCUS AW958056 505 bp mRNA linear EST 01-JUN-2000 

DEFINITION EST370126 MAGE resequences, MAGE Homo sapiens cDNA, mRNA sequence. 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



, Dharap,S., Gaspard,R., Gay,C. 
Lee , N . H . , Yeatman , T . J . and 



Holt 



BASE COUNT 
ORIGIN 



AW958056 

AW958056. 1 GI :8147739 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 505) 
Hegde,P., Qi,R., Abernathy,K 
, I.E., Saeed,A.l., Sharov,V. 
Quackenbush, J . 

Assessment of gene expression patterns in a model of colon tumor 

metastasis using a 19,200 element cDNA microarray 

Unpublished 

Contact : John Quackenbush 

The Institute for Genomic Research 

9712 Medical Center Dr., Rockville, MD 20850, USA 

Tel: 301 838 3528 

Fax: 301 838 0208 

Ema i 1 : j ohnq@ t igr . org 

Plate: 115 

Seq primer: Reverse. 

Loca t ion/ Qual i f iers 
1. .505 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone_lib="MAGE resequences, MAGE" 
/note= "Vector : pBluescriptSKm" 
115 a 137 c 106 g 144 t 3 others 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 10; 
Pred. No. 1.7e-60; 
0; Mismatches 19; 



Length 505; 
Indels 0; 



Gaps 



0; 



Qy 
Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II IIIMMMMIMMIIIIIIIMM 1 1 1 1, 1 1 1 1 II 1 1 ! Illlllll MM! 

38 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 97 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



114 



98 



174 



GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Illlllll MINIM Mill 

GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 157 



CATCTGGAGCCTAGACACTCAGTACAACAAGGTC 233 

Ml Illlllll Mill lllllll 1 1 1 1 1 r 1 1 1 1 1 1 IMIIIMIIIIII Mill 

158 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 217 



234 



218 



294 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 IMIIIMIIIIII 

GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 77 



353 



CTA CGTGGG C CG CAAG C C CAAGGTGGAG CAG CTGTC CAA CATGAT CGTG CGTTCCTG CAA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Illlllll 

278 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCC^ACATGATCGTGCGCTCCTGCAA 3 37 



Qy 



3 54 GTGCAGCTGA 3 63 



IMIIIIIII 

Db 33 8 GTGCAGCTGA 34 7 



RESULT 10 

BQ222205 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BQ222205 536 bp mRNA linear EST 02-MAY-2002 

AGENCOURT_72 58595 NIH_MGC_71 Homo sapiens cDNA clone IMAGE : 5786680 
5 * , mRNA sequence. 
BQ222205 

BQ2222 05 .1 GI : 204 035 94 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 (bases 1 to 536) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl .gov 

Plate: LLAM12875 row: m column: 17 

High quality sequence stop: 535. 
Location/Qualifiers 
1. .536 

/ organ ism=" Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref = M taxon: 9606" 
/ clone=" IMAGE : 578 668 0" 
/tissue_type=" leiomyosarcoma " 
/lab_host="DH10B (phage-resistant) " 
/ c 1 one_l i b= " NI H__MGC__7 1 " 

/note= "Organ: uterus; Vector: pCMV-SP0RT6; Site_l: Not I; 
Site_2: Sail; Cloned unidirectionally . Primer: Oligo dT. 
Average insert size 2.1 kb. " 
113 a 193 c 150 g 77 t 3 others 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 27 9.6; DB 13; Length 536; 
Pred. No. 1.7e-60; 
0; Mismatches 19; Indels 0; 



Gaps 



0, 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

I! '! I II M 1 1 1 1 II 1 1 II 1 , 1 1 1 II 1 ,1 llllllllllllll llllllll Mill 

65 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 124 



Qy 

Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

lllllllll II II IIIMIMIIIIIIIII III llllllll llllllll Mill 

125 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 184 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



174 CATCTGGAGCCTAGACACTCAGTAO^CAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

III MINIM IMM IMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 Mill 

18 5 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 244 
234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 E I [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

24 5 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 3 04 



294 



353 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

M I ! 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 MINIM 

3 05 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 3 64 

3 54 GTGCAGCTGA 363 

IMIINI 
3 65 GTGCAGCTGA 374 



RESULT 11 

BM670402/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



BM670402 538 bp mRNA linear EST 27-FEB-2002 

UI-E-DWl-ahc-l-03-0-UI .si UI-E-DW1 Homo sapiens cDNA clone 
UI-E-DWl-ahc-l-03-0-UI 3', mRNA sequence. 
BM670402 

BM670402.1 GI:18980299 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 538) 

Bonaldo,M.F. , Lennon,G. and Soares, M.B. 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 

97044477 

8889548 

Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF , Iowa City, IA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento -soares@uiowa . edu 

Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, Univeristy of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com). 
The following repetitive elements were found in this cDNA 

sequence : 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 Forward 

P0LYA=Yes . 

Location/Qual if iers 
1. .538 

/organism="Homo sapiens" 



BASE COUNT 
ORIGIN 



/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone="UI-E-DWl-ahc-l-03-0-UI " 
/ 1 i s sue_t ype= " 1 ens " 
/dev_stage= "adult " 

/lab__host = "DH10B (Life Technologies) (Tl phage resistant) " 
/clone_lib= M UI-E~DWl" 

/note="Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polyl inker; Site_l: EcoR I; Site_2 : Not I; 
UI-E-DW1 is a normalized cDNA library containing the 
following tissue (s) : lens. The library was constructed 
according to Bonaldo, Lennon and Soares , Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned direct ionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail. The 
sequence tag for this library is CGATTAGCGA. This library 
was created for the program, Gene Discovery in the Visual 
System, supported by National Eye Institute (NEI) . 
TAG_LIB=UI-E-DW1 
TAG_TI SSUE=human lens 
TAG_S E Q = CGATTAG CGA " 
78 a 153 c 195 g 112 t 



Query Match 77.0%; Score 279.6; DB 12; Length 538; 

Best Local Similarity 93.9%; Pred. No. 1.7e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; 



Gaps 



0; 



Qy 

Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II 1 II 1 1 1 1 1 1 1 II I II 1 1! 1 1 'I I Ml i IMIIIIMMIII MINIM Mill 

4 65 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 4 06 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MM: 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II IMIIIII MINIM Mill 

4 05 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 34 6 
174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

Ml 1 1 1 II 1 1 1 INN MUM 1 1 1 r I f I f 1 1 1 1 1 ' 1 1 i 1 1 1 1 1 1 1 1 1 INN 

34 5 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 28 6 
234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

MINI II 1 1 1 1 II I II I Ml I M I Ml M I M I II MIM II I 1 1 1 1 1 j 1 1 1 1 1 1 > I 

2 85 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 22 6 

2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II I N I II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 MMMM 

225 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 166 

3 54 GTGCAGCTGA 3 63 

I I I I I i f I I I 
165 GTGCAGCTGA 156 



RESULT 12 
BI834732 

LOCUS BI834732 540 bp mRNA linear EST 04-OCT-2001 

DEFINITION 603090326F1 NIH_MGC_120 Homo sapiens cDNA clone IMAGE : 5229246 5', 

mRNA sequence. 
ACCESSION BI834732 

VERSION BI834732.1 GI:15946269 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 540) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpublished 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 
Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . llnl . gov 
Plate: LLAM11577 row : c column: 07 
High quality sequence stop: 535. 
FEATURES Locat ion/Qual i f iers 

source 1. .54 0 

/organ i sm= "Homo sapiens " 
/mol_type=" mRNA" 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 522 9246" 
/lab_host="DH10B" 
/clone_lib="NIH_MGC_120" 

/note="0rgan: pooled pancreas and spleen; Vector: 
pCMV-SP0RT6 ; Site_l: Not I; Site_2 : EcoRV (destroyed); RNA 
source anonymous pool of spleen and pancreas from 28 yo 
male. Library is oligo-dT primed and directionally cloned 
(EcoRV site is destroyed upon cloning) . Average insert 
size 1.5 kb, insert size range 1-2.5 kb. Library is 
normalized and enriched for full-length clones and was 
constructed by C. Gruber (Invitrogen) . Research Genetics 
tracking code 025. Note: this is a NIH_MGC Library." 

BASE COUNT 104 a 188 c 154 g 94 t 

ORIGIN 



Query Match 77.0%; Score 27 9.6; DB 12; Length 540; 

Best Local Similarity 93.9%; Pred. No. 1.7e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II M 1 1 1 1 : 1 1 1 II 1 1 1 1 M 1 1 1 , 1 1 1 1 1 llllllllllllll IIIIMII Mill 

Db 22 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 27 9 

Qy 114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

MIIIIMI 1 1 II MMMIMMMIIIMM IIIIMII IIIIMII Mill 



Db 



28 0 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 33 9 



Qy 174 CATCTGGAGCCTAGACACTCAGTACAACAAGGTCCTGGCTCTGTACAACCAGCACAACCC 233 

III llllllll Mill MINI MINIMUM MINIMUM Mill 

Db 34 0 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 3 99 

Qy 234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

WWW IIIIIIIIIIIIIIIININIIIIIIIIIIIIIIIII NINNI 

Db 4 00 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 4 59 

Qy 2 94 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

I II 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II N 1 1 II 1 1 II II 1 1 1 1 1 1 1 II llllllll 

Db 4 60 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 519 

Qy 3 54 GTGCAGCTGA 3 63 

Illlllllll 
Db 52 0 GTGCAGCTGA 52 9 



RESULT 13 
BQ016027/C 

LOCUS BQ016027 587 bp mRNA linear EST 17-JUN-2002 

DEFINITION UI-H-DTl-avz-h-20-0-UI .si NCI_CGAP_DT1 Homo sapiens cDNA clone 

IMAGE: 5886403 3', mRNA sequence. 
ACCESSION BQ016027 

VERSION BQ016027.1 GI: 19751304 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 587) 

AUTHORS NCI-CGAP http : //www . ncbi . nlm . nih . gov/ncicgap . 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 
JOURNAL Unpubl ished 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
The following repetitive elements were found in this cDNA 

sequence: 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 FORWARD 

P0LYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .587 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/ clone= " IMAGE : 5886403" 

/t is sue_type= "Metastatic Chondrosarcoma" 
/dev_s tage= "Adult " 

/lab_host="DH10B (Life Technologies)" 



BASE COUNT 
ORIGIN 



/ c 1 one_l ib = " NCI _CGAP__DT1 " 

/not e=" Organ: Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polyl inker; Site_l: EcoR I; Site_2 : Not I; 
NCI_CGAP_DT1 is a normalized cDNA library containing the 
following tissue(s): Metatastic Chondrosarcoma in Lung. 
The library was constructed according to Bonaldo, Lennon 
and Soares, Genome Research, 6:791-806, 1996. First strand 
cDNA synthesis was primed with an oligo-dT primer 
containing a Not I site. Double stranded cDNA was ligated 
to an EcoR I adaptor, digested with Not I, and cloned 
directionally into pT7T3-Pac vector. The oligonucleotide 
used to prime the synthesis of first-strand cDNA contains 
a library tag sequence that is located between the Not I 
site and the (dT) 18 tail. The sequence tag for this 
library is AACTGTTCGG. 
TAG_LIB=UI-H-DT1 

TAG_TISSUE=lung metatastic chondrosarcoma 
TAG SEQ=AACTGTTCGG" 
91 a 163 c 213 g 120 t 



Query Match 77.0%; 
Best Local Similarity 93.9%; 
Matches 291; Conservative 



Score 279.6; DB 12; Length 587; 
Pred. No. 1.8e-60; 
0; Mismatches 19; Indels 0; 



Gaps 



Qy 



Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II I IIIIMIIIIIIII MINIM Mill 

4 65 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 4 06 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



114 



405 



174 



GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 173 

1 1 1 1 1 II M II II MIMIMMMMIMMI 1 1 1 1 1 1 1 r IMIIMI Mill 

GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 346 



CATCTGGAGCCTAGACACTC^GTA(^(^GGTCCTGGCTCTGTA(^CC^GC^C^ACCC 

Ml IMIIMI Mill lllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMM 

345 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 



233 



286 



234 



285 



294 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 2 93 

M I II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 22 6 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMIIMI 

22 5 CTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 166 
354 GTGCAGCTGA 363 

IMMMMI 

165 GTGCAGCTGA 156 



RESULT 14 

BQ003136/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 



BQ003136 593 bp mRNA linear EST 26-MAR-2002 

UI-H-EIl-ayx-d-19-0-UI . si NCI_CGAP_EI1 Homo sapiens cDNA clone 
IMAGE: 5844834 3', mRNA sequence. 
BQ003136 

BQ003136. 1 GI : 19728036 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata ; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 593) 

NCI-CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail . nih .gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://imaqe.llnl.gov 
The following repetitive elements were found in this cDNA 

sequence: 97-152 , >GC_rich#Low_complexity 

Seq primer: M13 FORWARD 

POLYA=Yes . 

Loca t ion/ Qual i f iers 
1. .593 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone=" IMAGE: 5844834" 

/ 1 i ssue_type = " Chondrosarcoma " 

/dev_stage="Adult" 

/lab_host="DH10B (Life Technologies) " 
/clone_lib="NCI_CGAP_EIl" 

/note="0rgan: Left Pelvis; Vector: pT7T3-Pac (Pharmacia) 
with a modified polylmker; Site_l: EcoR I; Site_2: Not I; 
NCI_CGAP_EI1 is a normalized cDNA library containing the 
following tissue(s): Chondrosarcoma. The library was 
constructed according to Bonaldo, Lennon and Soares, 
Genome Research, 6:791-806, 1996. First strand cDNA 
synthesis was primed with an oligo-dT primer containing a 
Not I site. Double stranded cDNA was ligated to an EcoR I 
adaptor, digested with Not I, and cloned directionally 
into pT7T3-Pac vector. The oligonucleotide used to prime 
the synthesis of first-strand cDNA contains a library tag 
sequence that is located between the Not I site and the 
(dT) 18 tail. The sequence tag for this library is 
ACACTTGCAC . 
TAG_LIB=UI-H-EI1 
TAG_TISSUE=chondrosarcoma 
TAG_S EQ = ACACTTGCAC " 
91 a 164 c 217 g 121 t 



Query Match 77.0%; Score 279.6; DB 12; Length 593; 

Best Local Similarity 93.9%; Pred. No. 1.8e-60; 

Matches 291; Conservative 0; Mismatches 19; Indels 0; Gaps 



0; 



Qy 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 



Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



465 



114 



II IMIIIIIIIIIIIIIIIMIIIIIII IIIMIIIIIIIII llllllll Mill 

CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 



406 



173 



GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

IIIIIMM II II 1 1 1 1 1 1 1 1 1 1 1 1 1 il 1 1 1 1 1 llllllll llllllll Mill 

4 05 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 34 6 



174 



354 



233 



CATCTGGAGCCTAGAC^CTCAGTACAACAAGGTCCTGGCTCTGTACAACaVGCACAACCC 

III IIIIIMI MMI Ml. F 1 1 1 f 1 1 1 1 1 f I IIIMMMIMM Mill 

34 5 CATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC 28 6 



234 



293 



GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 

llllll IMMIIIIIIMIIIIIIIIMIIIIIIIMMIIM 1 1 1 1 1 1 1 II 1 1 1 1 1 

2 8 5 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 226 



294 



353 



CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 

I I I I I I , I i I i I I I I ! I I I I I I I I M I i I I I I ; I I I I 1 1 I I 1 I I I ! 1 1 llllllll 

22 5 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 166 



363 



GTGCAGCTGA 

MM M 
165 GTGCAGCTGA 156 



RESULT 15 

AI951831/C 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



AI951831 598 bp mRNA linear EST 06-SEP-1999 

wx38b08.xl NCI_CGAP_Pitl Homo sapiens cDNA clone IMAGE : 2545911 3' 
similar to gb :X02812_cdsl TRANSFORMING GROWTH FACTOR BETA 1 
PRECURSOR (HUMAN) /contains PTR7 . t3 MER22 repetitive element ;, mRNA 
sequence . 
AI951831 

AI951831 . 1 GI :5744141 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 598) 

NCI -CGAP http: //www.ncbi .nlm.nih.gov/ncicgap. 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 

Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs -r@mail .nih.gov 

Tissue Procurement: Chris Moskaluk, M.D., Ph.D., Michael R. 
Emmert-Buck, M.D., cDNA Library Preparation: Life Technologies, 
Inc. cDNA Library Arrayed by: Chris ta Prange, The I.M.A.G.E. 
Consortium DNA Sequencing by: Washington University Genome 
Sequencing Center 

Clone distribution: NCI -CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio. llnl . gov/bbrp/ image/ image. html 
Seq primer: -4 0UP from Gibco 
High quality sequence stop: 425. 

Location/Qualifiers 

1. .598 



BASE COUNT 
ORIGIN 



/organ ism= "Homo sapiens" 
/ mo 1 _ t yp e = " mRN A ■■ 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 254 5911" 

/tissue__type=" four pooled pituitary adenomas" 

/lab_host="DH10B" 

/clone_lib="NCI_CGAP_Pitl" 

/note="Organ-. brain; Vector: pCMV-SPORT6 ; Site_l: Sail; 
Site_2 : Not I; Cloned unidirectionally . Primer: Oligo dT. 
Library constructed by Life Technologies." 
95 a 173 c 222 g 107 t 1 others 



Query Match 77 . 0%; 

Best Local Similarity 93.9%; 
Matches 2 91; Conservative 



Score 279.6; DB 9; 
Pred. No. l.8e-60; 
0; Mismatches 19; 



Length 5 98; 
Indels 0; 



Gaps 



0; 



Qy 



Db 



54 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 113 

II III I II 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 il ! I 1 1 1 !l 'II I II 1 1 1 1 1 ! 1 1 1 1 1 Mill 

44 7 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 38 8 



Qy 

Db 

Qy 

Db 



114 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 

MINIMI II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMI I MM MMI 

387 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 



174 



327 



C^TCTGGAGCCTAGAC^CTC^GTACAAC^GGTCCTGGCTCTGTAa^CC^GaVQ^CCC 

III IMMMI Mill MIMM MIMMIMM 1 1 1 MMI 

(^TTTGGAGCCTGGACACGC^GTAC^Ga^GGTCCTGGCCCTGTACAACCAGCATAACCC 



173 



328 



233 



268 



Qy 



Db 



234 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 293 

MUM I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 67 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 08 



Qy 

Db 

Qy 
Db 



294 



207 



354 



147 



CTACGTGGGCCGCAAGCC(^GGTGGAGC^GCTGTCCAACATGATCGTGCGTTCCTGCAA 353 

i f 1 1 1 r 1 1 j i f m 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 MiMiii 

CTACGTGGGCCGC^GCCC^AGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAA 14 8 



363 



GTGCAGCTGA 

Mill MM 

GTGCAGCTGA 138 



Search completed: October 28, 2003, 00:08:15 
Job time : 902.804 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein 
Run on: 

Title: 
Perfect score; 685 



protein search, using sw model 
October 27, 2003, 19:11:58 

US-10-017-372E-15 



; Search time 10.0108 Seconds 

(without al ignments ) 

1902.657 Million cell updates/sec 



Sequence : 
Scoring table: 

Searched : 



1 ALDTNYCFSSTDYKDDDDKE GRKPKVEQLSNMI VRSCKCS 120 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database : 



A Geneseq_19Jun03 : ■ 



1 
2 
3 
4 

5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 



/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1980 . DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1981 .DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1982 . DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1983 . DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1984 .DAT 
IS I DS 1 /gcgda t a /genes eq/genes eqp - embl /AA1 98 5. DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1986 .DAT 
/SIDS1 /gcgdata/geneseq/geneseqp-embl/AA1987 . DAT 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1988 . DAT 
/SlDSl/gcgdata/geneseq/geneseqp-embl/AA1989 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1990 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1991 .DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1992 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1993 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1994 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1995 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1996 .DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1997 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1998 .DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA1999 . DAT: 
/ S I DS 1 /gcgda t a /gene s eq/ genes eqp - embl / AA2 00 0. DAT : 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2001 . DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2 002 .DAT: 
/SIDSl/gcgdata/geneseq/geneseqp-embl/AA2003 .DAT: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 


Score 


Match 


Length 


DB 


ID 


Description 


1 
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91 


. 1 


112 


11 


AAR04076 


Sequence of mature 


2 


624 


91 


.1 


112 


11 


AAR08142 


Platelet-derived h 


3 


624 


91 


. 1 


112 


14 


AAR43263 


TGF-beta. Homo sa 


4 


624 


91 


. 1 


112 


14 


AAR42311 


Recombinant human 


5 
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91 


. 1 


112 


17 


AAR92773 


Human TGF-beta 1. 


6 
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91 


. 1 


112 


17 


AAR91956 


Human transforming 


7 
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91. 


. 1 


112 


18 


AAW08173 


TGF-betal active f 


8 


624 


91. 


. 1 


112 


19 


AAW78781 


Human transforming 


9 


624 


91 


. 1 


112 


20 


AAY08299 


Human growth facto 


10 


624 


91 


. 1 


112 


20 


AAW97091 


The mature form of 


11 


624 


91 . 


. 1 


112 


20 


AAW84 2 07 


- - . J _J> w 


12 


624 


91. 


.1 


112 


21 


AAY92010 


Human transforming 


13 


624 


91 . 


. 1 


112 


21 


AAY67950 


Human transforming 


14 


624 


91 , 


. 1 


112 


22 


AAB35937 


TGF-beta 1 amino a 


15 


624 


91. 


. 1 


112 


23 


AAM51939 


Human TGFbeta prot 


16 


624 


91 . 


. 1 


112 


24 


ABU08656 


Human transforming 


17 


624 


91. 


. 1 


115 


22 


AAB73204 


TGF-betal C-termin 


18 


624 


91. 


, 1 


115 


24 


ABG76033 


Human TGF-beta 1 C 


19 
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91 . 


. 1 


118 


18 


AAW08176 


TGF-beta fusion pr 


20 


624 


91. 


. 1 
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18 


AAW08181 


TGF-beta fusion pr 


21 


624 


91. 


. 1 
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18 


AAW08177 


TGF-beta fusion pr 


22 


624 


91 . 


. 1 
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18 


AAW0818 8 


TGF-beta fusion pr 


23 


624 


91 . 


. 1 


130 


18 


AAW08185 


TGF-beta fusion pr 


24 


624 


91 . 


, 1 


131 


18 


AAW08182 


TGF-beta fusion pr 


25 


624 


91 . 


. 1 


132 


18 


AAW0818 9 


TGF-beta fusion pr 


26 


624 


91. 


. 1 


133 


18 


AAW08178 


TGF-beta fusion pr 


27 


624 


91 . 


. 1 


134 


18 


AAW08187 


TGF-beta fusion pr 


28 


624 


91. 


. 1 


134 


18 


AAW08190 


TGF-beta fusion pr 


29 


624 


91 . 


1 


135 


18 


AAW08183 


TGF-beta fusion pr 


30 


624 


91. 


1 


136 


18 


AAW08186 


TGF-beta fusion pr 


31 


624 


91 . 


. 1 


137 


18 


AAW0818 0 


TGF-beta fusion pr 


32 


624 


91 . 


1 


137 


18 


AAW08184 


TGF-beta fusion pr 


33 


624 


91. 


1 
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18 


AAW08179 


TGF-beta fusion pr 


34 


624 


91. 


. 1 
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7 


AAP61468 


PreTGF-beta gene p 


35 


624 


91. 


1 
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11 


AAR04034 


Sequence of pre-TG 


36 


624 


91 . 


1 
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11 


AAR05258 


Human pre- trans for 


37 


624 


91. 


1 


390 


12 


AAR13813 


Human pro -TGF-beta 


38 


624 


91 . 


1 


390 


13 


AAR20124 


Sequence of simian 


39 


624 


91 . 


1 


390 


15 


AAR46227 


Human pre-TGF-beta 


40 


624 


91 . 


1 


390 


16 


AAR73596 


Human TGF-beta 1 p 


41 


624 


91. 


1 


390 


17 


AAR90827 


Pre-transforming g 


42 


624 


91. 


1 


390 


19 


AAW78785 


Human pre- trans for 


43 


624 


91 . 


1 


390 


22 


AAM39186 


Human polypeptide 


44 


624 


91 . 


1 


390 


23 


AAU77101 


Human transforming 


45 


624 


91 . 


1 


390 


23 


AAE13596 


Porcine transformi 



ALIGNMENTS 



RESULT 1 
AAR04076 

ID AAR04076 standard; protein; 112 AA. 
XX 

AC AAR04076; 
XX 

DT 25-MAR-2003 (updated) 

DT 31-OCT-2002 (updated) 

DT 31-MAY-1989 (first entry) 
XX 

DE Sequence of mature human TGF-beta 1. 
XX 

KW Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 

KW inhibition. 

XX 

OS Homo sapiens. 
XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 88WO-US01945 . 
XX 

PR 08-JUN-1988; 88WO-US01945 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 used as a 

PT probe, or to produce TGF beta 3, for inhibiting growth of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. 3; 61pp; English. 
XX 

CC This sequence is that of mature human transforming growth factor-beta 1 

CC (TGF-beta 1) polypeptide. It differs from human and porcine TGF-beta 

CC subtype 3 sequences at positions 9-11,13,19,40,45,52,57,58,60,63,67,68, 

CC 71 , 75 , 82 , 87 , and 95. In addition porcine TGF-beta 3 has an AA substitution 

CC at posn. 96. The N- terminal AA sequence of bovine TGF-beta 2 differs from 

CC that of human TGF-beta 1 at posns . 4 , 5 , 9- 13 , and 19. 

CC The nucleic acid sequence encoding the second subtype of TGF-beta (TGF- 

CC beta 3) is useful as a probe or to produce TGF-beta 3 for both normal and 

CC neoplastic cell growth inhibition. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 11; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 



Illllllllll I II I MM. I MM 1 1 II. ,M II: III II 1 1 Mill ill 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

MMMMMMMMIMMMMMMMMMMMMI MIMMMMMMMI 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 112 

RESULT 2 
AAR08142 

ID AAR08142 standard; protein; 112 AA. 
XX 

AC AAR08142; 
XX 

DT 25-MAR-2003 (updated) 

DT 28-FEB-1991 (first entry) 

XX 

DE Platelet -derived human TGF-beta monomer. 
XX 

KW Transforming growth factor; cartilages inducing factor; CIF; 

KW polycythemia; thrombocytosis; slpenomegaly . 

XX 

OS Homo sapiens. 
XX 

PN US4971952-A. 
XX 

PD 20-NOV-1990. 
XX 

PF 27-OCT-1988; 8 8US- 0263635 . 
XX 

PR 27-OCT-1988; 88US- 0263635 . 
XX 

PA (CLGE ) COLLAGEN CORP. 
XX 

PI Bentze H, Ellingsworth L, Armstrong R; 
XX 

DR WPI; 1990-368150/49. 
XX 

PT Treating inflammation assosiated with transplants, etc. - using 

PT cartilage inducing factor polypeptide homodimer cpd. 

XX 

PS Disclosure; Fig 1; 14pp; English. 
XX 

CC The TGF-beta may be used to derive cartilage inducing factors (CIFs) 

CC useful for inhibition of acute/chronic inf lammat ion, treatment of red 

CC blood cell dysfunction associated with polycythemia, thrombocytosis 

CC or splenomegaly. 

CC (Updated on 25-MAR-2003 to correct PR field.) 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 11; Length 112; 
Best Local Similarity 93.3%; Pred . No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MMMMMI I II M IIMMIIMM I II MM MMM Ml IMM III 



Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYS KVLALYNQHN PGASAAPCCVPQALE PLP I VYY VGRKPKVEQLSNM I VRS CKCS 12 0 

IMIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIMIMIIIIIIIIIIMIIII 

Db 53 SLDTQYS KVLALYNQHN PGASAAPCCVPQALE PL PI VYY VGRKPKVEQLSNM I VRS CKCS 112 



RESULT 3 
AAR43263 

ID AAR43263 standard; protein; 112 AA. 
XX 

AC AAR43263; 
XX 

DT 25-MAR-2003 (updated) 

DT 27-APR-1994 (first entry) 

XX 

DE TGF-beta. 
XX 

KW TGF; transforming growth factor; scar formation; CNS; 

KW central nervous system; infusion; antagonist; brain; 

KW spinal cord; regeneration; decorin; Arg-Gly-Asp. 
XX 

OS Homo sapiens. 
XX 

PN W09319783-A1. 
XX 

PD 14-OCT-1993. 
XX 

PF 01-APR-1993; 93WO-US03068. 
XX 

PR 01-APR-1992; 92US-0860704 . 
XX 

PA (WHIT- ) WHITTIER INST DIABETES & ENDOCRINOLOGY. 
XX 

PI Baird A, Logan A; 
XX 

DR WPI; 1993-336598/42. 
XX 

PT Inhibiting or enhancing scar formation in the CNS - by infusion 

PT of transforming growth factor-beta antagonists, e.g. 

PT anti- (TGF-beta) -antibodies, or TGF-beta itself, respectively 

XX 

PS Disclosure; Page 5; 32pp; English. 
XX 

CC The mature form of TGF-beta is comprised of two identical chains, 

CC each of 112 amino acids. Since TGF-beta is a regulator of scar 

CC prodn. after a penetrating injury to the brain or spinal cord, which 

CC scar prodn. can preclude neuronal recovery, TGF-beta antagonists 

CC have potential use as adjuncts to those therapies designed to 

CC promote regeneration and reconnection of damaged neural pathways. 

CC (Updated on 25-MAR-2003 to correct PN field.) 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 14; Length 112; 

Best Local Similarity 93.3%; Pred . No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

IIIIIIIIMI I I I I I I I I , I I I I 1 I I I I ' ! I I I ' I I [ I I I I I I I M 

Db 1 ALDTNYCFSST EKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II II I I I I I I I I I I I I I I I II I I 
Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 



RESULT 4 
AAR42311 

ID AAR42311 standard; peptide; 112 AA. 
XX 

AC AAR42311; 
XX 

DT 01-APR-1994 (first entry) 
XX 

DE Recombinant human TGF-beta. 
XX 

KW Transforming growth factor beta; stable; treatment; wounds; 

KW fractures; inflammation; autoimmune disease; tumours. 

XX 

OS Homo sapiens (recombinant) . 
XX 

PN JP05252956-A. 
XX 

PD 05-OCT-1993. 
XX 

PF ll-JUL-1991; 91 JP- 017128 0 . 
XX 

PR ll-JUL-1991; 91 JP- 0 1 7 12 8 0 . 
XX 

PA (EART ) EARTH SEIYAKU KK. 
XX 

DR WPI; 1993-347572/44. 
XX 

PT Prodn. of recombinant cell for high expression of polypeptide 

PT such as human TGF-beta - for treatment of e.g. wounds, fractures, 

PT inflammation, auto-immune diseases or tumours 

XX 

PS Disclosure; Page 15; 18pp; Japanese. 
XX 

CC Human transforming growth factor (TGF) -beta can be produced in 

CC recombinant cells, and thus produced is stable in the presence of 50 

CC microg/ml methotrexate. TGF-beta may be produced in large amounts 

CC via recombinant means and has uses such a wound healing, treatment 

CC of fractures, inflammation, autoimmune disease or tumours. 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 14; Length 112; 
Best Local Similarity 93.3%; Pred . No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

IIIIIIIIMI IIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIMII 



Db 1 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIMIIIIIIIMIII 

Db 53 SLDTQYS KVLALYNQHN PGASAAPCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRS CKCS 112 



RESULT 5 
AAR92773 

ID AAR92773 standard; Protein; 112 AA. 
XX 

AC AAR92773; 
XX 

DT 17-JUL-1996 (first entry) 
XX 

DE Human TGF-beta 1. 
XX 

KW Transforming growth factor type beta; TGF-beta 1; 

KW protein renaturation; protein folding. 

XX 

OS Homo sapiens . 
XX 

PN WO9603433-A1 . 
XX 

PD 08-FEB-1996. 
XX 

PF 12-JUL-1995; 95WO-EP02 7 1 9 . 
XX 

PR 25-JUL-1994; 94EP - 08 1 043 9 . 
XX 

PA (CIBA ) CIBA GEIGY AG. 
XX 

PI Cerletti N; 
XX 

DR WPI; 1996-117000/12. 

DR N-PSDB; AAT17235. 
XX 

PT Prodn. of dimeric biologically active transforming growth factor 

PT by refolding denatured monomer in detergent- free folding buffer 

PT contg. specific organic solvent to improve yield 
XX 

PS Example IB; Page 29-30; 54pp; English. 
XX 

CC Non-soluble, monomeric transforming growth factor TGF-beta 1 

CC (AAR92773) was recovered from E. coli LC 137 (DSM 5658) transf ormants 

CC carrying plasmid pPLMu . TGF-beta 1, which includes the coding 

CC sequence (AAT17235) for TGF-beta 1. A biologically active, dimeric 

CC form of TGF-beta 1 was obtd. by refolding this monomer in detergent- 

CC tree buffer contg. DMSO and/or DMF . Dimers of TGF-beta 2 (AAR92774) 

CC and TGF-beta 3 (AAR92772), and hybrid dimers (see also AAR92775-77) , 

CC were also produced. 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 17; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MMIIIIIII , 1 1 1 1 1 1 1, 1. 1 i 1 1 1 1 M III 1 1 II II 1 1 1 1 1 1 1 1 Ml ! 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYS KVLALYNQHN PGASAAPCCVPQALE PLP I VYY VGRKPKVEQLSNM I VRSCKCS 120 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 

RESULT 6 
AAR91956 

ID AAR91956 standard; Protein; 112 AA. 
XX 

AC AAR91956; 
XX 

DT 10-JUN-1996 (first entry) 
XX 

DE Human transforming growth factor beta 1. 
XX 

KW Transforming growth factor beta; TGF ; regulator; method; 

KW proliferation; differentiation; wound healing; solvent. 
XX 

OS Homo sapiens. 
XX 

PN WO9603432-A1. 
XX 

PD 08-FEB-1996. 
XX 

PF 12-JUL-1995; 95WO-EP02718 . 
XX 

PR 25-JUL-1994; 94EP- 08 10438 . 
XX 

PA (CIBA ) CIBA GEIGY AG. 
XX 

PI Cerletti N; 
XX 

DR WPI; 1996-116999/12. 

DR N-PSDB; AAT15462. 
XX 

PT Prodn. of dimeric, biologically active transforming growth factor 

PT beta - by refolding denatured monomer in buffer contg. mild 

PT detergent and specific organic solvents to improve yields 
XX 

PS Example 1; Page 33; 5 9pp; English. 
XX 

CC AAR91956-R91958 are the amino acid sequences of human transforming 

CC growth factor (TGF) beta-1, TGF beta-2 and TGF beta-3 which are used to 

CC produce TGF beta-like proteins in dimeric form. The TGF beta-like 

CC proteins produced are hybrids of 2 different types of TGF beta e.g. 

CC TGF beta-1-3, TGF beta-2-3, etc, or bone morphogenic proteins e.g. 

CC BMP-2. The TGF beta hybrids were made using a new process of producing 

CC dimeric, biologically active TGF beta-like proteins. The new process 

CC involves treating denatured TGF beta monomers with folding buffer contg. 

CC a mild detergent (CHAPS , CHAPSO or digitonm) and at least one of the 

CC solvents DMSO (dimethyl sulphoxide) , DMS02 (dimethylsulphone) and DMF 

CC (dimethyl formamide) . The detergent allows folding of the monomer such 



CC that, after dimerisation, the TGF beta-like protein retains biological 

CC activity and remains in soluble form. The method allows relatively high 

CC yields of biologically active TGF beta-like proteins in their native 

CC dimeric form. TGF-beta like proteins are multifunctional regulators of 

CC cellular activity and a typical use is to stimulate wound healing. 
XX 

SQ Sequence 112 AA; 



Query Match 91.1%; Score 624; DB 17; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

lllllllllll llllllllllllllllllllllllllllllllllllllll 
Db 1 ALDTNYCFSST EKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYS KVLAL YNQHNPGASAA PCC VPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I !!!!!!!!!!!!!!!!!!!!!! j !!!!! MM! ! 

Db 53 SLDTQYSKVTALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 



RESULT 7 
AAW08173 

ID AAW08173 standard; Peptide; 112 AA. 
XX 

AC AAW08173; 
XX 

DT 26-AUG-1997 (first entry) 
XX 

DE TGF-betal active fragment of a TGF-beta fusion protein. 
XX 

KW Transforming growth factor-beta fusion protein; wound healing; 

KW artificial skin; surgery recovery time. 

XX 

OS Homo sapiens. 
XX 

PN WO9639430-A1 . 
XX 

PD 12-DEC-1996. 
XX 

PF 05-JUN-1996; 96WO-US08973 . 
XX 

PR 06-JUN-1995; 95US - 047 0837 . 
XX 

PA (CHEU/) CHEUNG D T. 

PA (HALL/) HALL F L. 

PA (NIMN/) NIMNI M E. 

PA (TUAN/) TUAN T. 

PA (WULL/) WU L. 
XX 

PI Cheung DT, Hall FL, Nimni ME, Tuan T, Wu L; 
XX 

DR WPI; 1997-043065/04. 

DR N-PSDB; AAT42771. 
XX 

PT Prepn. of transforming growth factor-beta fusion protein - useful to 

PT reduce surgery recovery time and to prepare artificial skin 



XX 

PS Claim 12; Page 45-46; 59pp; English. 
XX 

CC A novel transforming growth factor-beta (TGF-beta) fusion protein 

CC comprises a purification tag and a TGF active fragment. The present 

CC sequence represents a specifically claimed TGF-betal active fragment. 

CC Additionally, the fusion protein may comprise proteinase-sensitive 

CC linker sites and binding domain so the protein sequence may contain 

CC some or all of the following elements: purification tag : proteinase 

CC site:ECM binding site : proteinase site : TGF-beta . TGF-beta promotes 

CC wound healing, and the fusion protein can be used to reduce surgery 

CC recovery time and in the preparation of artificial skin. The inclusion 

CC of a purification tag facilitates purification of the fusion protein. 

CC The proteinase site is included to permit cleavage and release of the 

CC purification tag after purification if desired. The extracellular 

CC matrix binding site facilitates delivery of the fusion protein to the 

CC desired site of action. Delivery of the TGF-beta to the site to be 

CC treated reduces the amount of TGF-beta required to be admim'sfprpri to 

CC be effective and reduces the concentration of circulating TGF-beta 

CC which may result in undesirable effects. 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 18; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiiiii m 1 1 i 1 1 i i r 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 ; ii i ii i ii 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

'I IN 1 1 1 1 1 1 1 Ii II 1 1 1 1 II 1 1 1 1, 1 1 1 , II, 1 1 1; I Mill 1 1 1 1 1 M 1 11:1 1 1 II 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 



RESULT 8 
AAW78781 

ID AAW78781 standard; Protein; 112 AA. 
XX 

AC AAW78781; 
XX 

DT 25-MAR-2003 (updated) 

DT 21-DEC-1998 (first entry) 

XX 

DE Human transforming growth factor-beta 1 N- terminal sequence. 
XX 

KW Transforming growth factor-beta 1; TGF-beta 1; human. 
XX 

OS Homo sapiens. 
XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US-0454468 . 
XX 



PR 13-MAR-1987; 87US - 0025423 . 

PR 22-MAR-1985; 85US- 0715142 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US-0845893 . 

PR 05-NOV-1993; 93US-0147364 . 

PR 30-MAY-1995; 95US-0454468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Disclosure; Fig 3; 26pp; Enqlish. 
XX 

CC This is the amino acid sequence of human transforming growth factor 

CC beta-1 (TGF-beta 1, see also AAW78785) . The sequence shows homology 

CC to human TGF-beta 3 (see AAW78782), porcine TGF-beta 3 (see AAW78783) 

CC and bovine TGF-beta 2 (see AAW78784) N- terminal sequences. The 

CC invention relates to the recombinant production of TGF-beta. DNA 

CC encoding TGF-beta has been isolated and cloned into vectors which 

CC are replicated in bacteria and expressed in eukaryotic cells. 

CC TGF-beta recovered from transformed cells can be used in known 

CC therapeutic applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 19; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiiiii 1 1 1 1 1 1 ii ii ii i : 1 1 1 1 :i i ii 1 1 1 1 hi M i ii 1 1 1 ii i 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 .' II 1 1 M II , 1 1 1 1 1 1 1 1 1 1 Ihl II 1 1 II 1 1 1 II I 1 1 1 1 II 1 1! I II i, 1 1 1 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 

RESULT 9 
AAY08299 

ID AAY08299 standard; Protein; 112 AA. 
XX 

AC AAY08299; 
XX 

DT 14-JUL-1999 (first entry) 
XX 

DE Human growth factor protein fragment TGF-Betal . 
XX 

KW Growth factor; human; dimer; cysteine knot; cellular inclusion body; 

KW pharmaceutical. 



XX 

OS Homo sapiens. 
XX 

PN DE19748734-A1 . 
XX 

PD 06-MAY-1999. 
XX 

PF 05-NOV-1997; 97DE- 1048734 . 
XX 

PR 05-NOV-1997; 97DE- 1 04 8734 . 
XX 

PA (GBFB ) GES BIOTECHNOLOGI SCHE FORSCHUNG MBH . 
XX 

PI Erdmann H, Kaerst U, Mueller C, Rinas U, Weich H; 
XX 

DR WPI; 1999-278785/24. 
XX 

PT Preparing active growth factor dimers from inclusion bodies in high 

PT yield 

XX 

PS Claim 14; Page 12; 14pp; German. 
XX 

CC This invention describes the novel preparation of biologically active 
CC dimers of recombinant human growth factors of the cysteine knot family 
CC starting from cellular inclusion bodies. Such dimers are are useful in 
CC pharmaceutical compositions and the method provides yields of 31-39.7%, 
CC in examples, compared with about 10% for the conventional method (see 
CC Biochemistry, 28 (1989) 2956) . AAY08278 - Y08301 are human growth factor 
CC protein fragments used in the method of the invention. 
XX 

SQ Sequence 112 AA; 



Query Match 91.1%; Score 624; DB 20; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIUM! MIIIIIMMIIIIIIIMIIIIMIIIMIIIMIIIII 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

MMMM MMMMMMMMMMMMM MMMMMMMMMMMMM 

Db 53 SLDTQYS KVLALYNQHN PGASAAPCCVPQALEPLP I VY Y VGRKPKVEQLSNM I VRS CKCS 112 



RESULT 10 
AAW97091 

ID AAW97091 standard; Protein; 112 AA. 
XX 

AC AAW97091; 
XX 

DT 20-MAR-2003 (updated) 

DT 28-APR-1999 (first entry) 

XX 

DE The mature form of transforming growth factor-beta- 1 . 
XX 

KW Transforming growth factor-beta- 1 ; TGF-beta-like protein; 



KW S-sulphonated TGF-beta-like protein; wound treatment; cancer; 

KW bone repair; tissue repair; bone marrow protective agent; 

KW cardioprotection; ant i- inflammatory; immunosuppressive; 

KW ulcer; bed sore. 
XX 

OS Homo sapiens. 
XX 

PN EP891985-A1. 
XX 

PD 20-JAN-1999. 
XX 

PF 27-NOV-1990; 98EP- 0 1 134 8 7 . 
XX 

PR 06-DEC-1989; 8 9GB- 002754 6 . 

PR 27-NOV-1990; 90EP-0810922 . 
XX 

PA (NOVS ) NOVARTIS AG. 
XX 

PI Cerletti N, Cox D, McMaster GK, Meyhack B, Schmitz A; 
XX 

DR WPI; 1999-083520/08. 

DR N-PSDB; AAX15245. 
XX 

PT Producing biologically active dimeric Transforming Growth 

PT Factor-beta - by refolding new monomeric Transforming Growth 

PT Factor-beta, useful for treatment of wounds and cancer 
XX 

PS Example 1; Page 28; 32pp; English. 
XX 

CC The present sequence represents the mature form of transforming growth 

CC f actor-beta-1 . Dimeric, biologically active TGF-beta-like protein 

CC can be produced by subjecting the denatured monomeric form to refolding 

CC conditions. The new monomeric S-sulphonated TGF-beta-like protein is 

CC useful for the production of the dimeric, biologically active 

CC TGF-beta-like protein, which is useful for the treatment of wounds 

CC (surface or internal) and cancer in a mammal, in bone and tissue 

CC repair, as a bone marrow protective agent, a mediator of 

CC cardioprotection, for the production of an ant i - inflammatory or 

CC immunosuppressive preparation. Treatment is useful for animals, 

CC especially humans, and wound treatment (e.g. ulcers, bed sores etc.) is 

CC particularly useful for the elderly. 

CC (Updated on 20-MAR-2003 to correct PF field.) 

CC (Updated on 20-MAR-2003 to correct PR field.) 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 20; Length 112; 
Best Local Similarity 93.3%; Pred . No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

Illllllllll , ' : 1 1 1 1, 1 1 1 1 1 1 ii: I '1 1 1 1 1 ! ii 1 1 1 1: 1 1! 1,1 

Db 1 ALDTNYCFSST EKNCCVRQLY I DFRKDLGWKWI HE PKGYHANFCLG PCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 II 1 1 1 1 1 II Ml 1 1 1 1 , 1 1 1 1 1 h Ml 1 1 1 1 1 II; I II 1 1 1 1 1 II! 1 1 II I II . 'I! 

Db 53 SLDTQYSKVTiALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 112 



RESULT 11 
AAW84207 

ID AAW84207 standard; Protein; 112 AA. 
XX 

AC AAW84207; 
XX 

DT 25-MAR-1999 (first entry) 
XX 

DE Transforming growth factor beta active fragment. 
XX 

KW Proteinase site; bone morphogenet ic fusion protein; bone binding site; 

KW bone morphogenet ic protein; transforming growth factor beta; 

KW active fragment; wound healing; bone growth; purification tag. 
XX 

OS Homo sapiens. 
XX 

PN W09855137-A1. 
XX 

PD 10-DEC-1998. 
XX 

PF 02-JUN-1998; 98WO-US11 189 . 
XX 

PR 03-JUN-1997; 97US-0868452 . 
XX 

PA (HALL/) HALL F L. 

PA (HANB/) HAN B. 

PA (NIMN/) NIMNI M E . 

PA (SHOR/) SHORS E C. 

PA (WULL/) WU L. 

XX 

PI Hall FL, Han B, Nimni ME, Shors EC, Wu L; 
XX 

DR WPI; 1999-059875/05. 

DR N-PSDB; AAV99375. 
XX 

PT New bone morphogenetic fusion proteins - comprising a purification 

PT tag and a bone morphogenetic active fragment, used for enhancing 

PT wound healing or bone growth 
XX 

PS Example 1; Page 41; 64pp; English. 
XX 

CC The present sequence represents a transforming growth factor beta active 

CC fragment. The protein can be used in place of a bone morphogenetic 

CC active fragment to create the fusion proteins of the invention. When a 

CC bone morphogenetic active fragment is used, the fusion proteins are 

CC designated bone morphogenetic fusion proteins. The bone morphogenetic 

CC fusion protein may contain some or all of the following elements: a 

CC purification tag, a proteinase site, an ECM/bone binding site, a second 

CC proteinase site, and a bone morphogenetic protein active fragment. 

CC The bone morphogenetic fusion proteins can be used for enhancing wound 

CC healing or bone growth. 

XX 

SQ Sequence 112 AA; 



Query Match 



91.1%; Score 624; DB 20; Length 112; 



Best Local Similarity 93.3%; Pred . No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MIIIIMIII 1 1 1.' 1 1 1 1 "1 1 1 1 .' II I II ! ll I II I II I ! 1 1 1 II 1 1 !l I 

Db 1 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 II I II I M 1 1 1 :M 1 1 M 1 1 I Ml i . 1 1 1 h II 1 1 > 1 1 1 1 Ml M 1 1 1 1 1 1 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 

RESULT 12 
AAY92010 

ID AAY92010 standard; Protein; 112 AA. 
XX 

AC AAY92010; 
XX 

DT 19-JUL-2000 (first entry) 
XX 

DE Human transforming growth factor beta 1 monomer. 
XX 

KW human transforming growth factor beta 1 monomer; CKGF; mutant; 

KW cystine knot growth factor; hairpin loop; infertility. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Misc-dif f erence 1..20 

FT /note= "optionally mutated to increase electrostatic 

FT interaction between beta hairpin structure and 

FT a receptor" 

FT Domain 21 . .40 

FT /label= beta_hairpin_loop_l 

FT /note= "mutant optionally comprises one or more 

FT substitutions in these residues" 

FT Misc-dif f erence 41.. 81 

FT /note= "optionally mutated to increase electrostatic 

FT interaction between beta hairpin structure and 

FT a receptor" 

FT Domain 82 . . 102 

FT /label* beta_hairpin__loop_3 

FT /note= "mutant optionally comprises one or more 

FT substitutions in these residues" 

FT Misc-dif f erence 103.. 112 

FT /note= "optionally mutated to increase electrostatic 

FT interaction between beta hairpin structure and 

FT a receptor" 

XX 

PN WO200017360-A1 . 
XX 

PD 30-MAR-2000. 
XX 

PF 19-MAR-1999; 99WO-US05908 . 
XX 

PR 22-SEP-1998; 98WO-US19772 . 
XX 



PA (UYMA-) UN IV MARYLAND BALTIMORE. 
XX 

PI Weintraub BD, Szkudlinski MW; 
XX 

DR WPI; 2000-283585/24. 
XX 

PT New mutant cystine knot growth factor proteins comprising one or more 

PT mutant subunits, useful for treating or preventing diseases e.g. 

PT hypothyroidism and thyroid cancer 
XX 

PS Claim 193; Page 3 00; 32 0pp; English. 
XX 

CC This is the wild type human transforming growth factor beta 1 monomer. 

CC Mutants comprise at least one electrostatic charge altering mutation in a 

CC beta hairpin loop, resulting in increased bioactivity. 

CC Mutant cystine knot growth factor (CKGF) proteins comprising one or more 

CC mutant subunits and having novel properties or improved pharmacological 

CC properties, compared to wild type CKGFs , are claimed. The CKGF 

CC superfamily comprises at least four families of growth factors: the 

CC glycoprotein hormones, the platelet -derived growth factor (PDGF) family, 

CC the neurotrophins and the transforming growth factor-beta family; the 

CC families are known to be structurally similar (especially comprising the 

CC cystine knot topology) and it was shown that mutations at certain 

CC positions in the CKGF hairpin loops of family members and other members 

CC of the CKGF superfamily could significantly alter the biological 

CC activities of the CKGF. 

CC Mutant transforming growth factor family proteins or analogues are useful 

CC for treatment of ovulatory dysfunction, luteal phase defect, unexplained 

CC infertility, time-limited conception and in assisted reproduction. 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 21; Length 112; 

Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

1 1 1 1 1 1 1 . 1 1 1 ii . i 'i 1 1 1 1 1 1 1 1 m 1 1 1 1 1 ii i ii ; ii 1 1 1 1 1 1 1 1 1 1 ii 

Db 1 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

M '1 1 1 1 1 1 L I 1 1 1 1 1 1 1 1 M I II I II I III 1 1 1 1 1 II i I, II I II 1 1 ! 1 1 '1 1 1 1 II 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 112 

RESULT 13 
AAY67950 

ID AAY67950 standard; protein; 112 AA. 
XX 

AC AAY67950; 
XX 

DT 03-APR-2000 (first entry) 
XX 

DE Human transforming growth factor SEQ ID NO: 2. 
XX 

KW Human; transforming growth factor; Pax4 gene; expression; potentiator; 

KW insulin; pancreatic beta cell; diabetes. 



XX 

OS Homo sapiens. 
XX 

PN WO9966073-A1 . 
XX 

PD 23-DEC-1999. 
XX 

PF 15-JUN-1999; 99WO- JP03 182 . 
XX 

PR 16-JUN-1998; 98 JP- 0167976 . 
XX 

PA (YAMA ) YAMANOUCHI PHARM CO LTD. 
XX 

PI Ueda Y; 
XX 

DR WPI; 2000-097752/08. 
XX 

PT Screening potential Pax4 gene potentiators, used in treatment of, e.q. 

PT diabetes - 

XX 

PS Disclosure; Page 29; 38pp; Japanese. 
XX 

CC The present invention describes the a method for screening potential 

CC inhibitors of the expression of the Pax4 gene by contacting the 

CC potential inhibitor with pancreatic beta cells and measuring the 

CC expression of the gene in these cells is new. Substances identified 

CC by the screening method potentiate the expression of the Pax4 gene in 

CC pancreatic beta cells and accelerate the expression of insulin gene in 

CC those cells. The method can be used in the treatment of disorders in 

CC which the exhaustion of pancreatic beta cells is involved, such as 

CC diabetes. The present sequence represents the human transforming growth 

CC factor protein which is used in the exemplification of the present 

CC invention. 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 21; Length 112; 
Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 60 

1 1 ' , 1 1 1 1 1 1 1 1 1 1 1! I 1 1 1 1 1, 1 1 1 ! II I Ml I !l I II 1 1 1 ! I II I, I II 

Db 1 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

Mil Ml lllill I IMI'MII lllllli Mi Ih IIM'I III INI III I IIIIMi 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 

RESULT 14 
AAB35937 

ID AAB35937 standard; protein; 112 AA. 
XX 

AC AAB35937; 
XX 

DT 26-FEB-2001 (first entry) 
XX 



DE TGF-beta 1 amino acid sequence. 
XX 

KW Heparin binding; vascular graft; matrix; cell adhesion; growth factor; 

KW wound healing; dermal wound; wound healing; TGF-beta 1. 

XX 

OS Unidentified. 
XX 

PN WO200064481-A1 . 
XX 

PD 02-NOV-2000. 
XX 

PF 22-APR-1999; 99WO-IB00800 . 
XX 

PR 22-APR-1999; 99WO- IB00800 . 
XX 

PA (ETHZ-) ETH ZURICH & UNIV ZURICH. 
XX 

PI Sakiyama SE, Hubbell JA; 
XX 

DR WPI; 2001-024627/03. 
XX 

PT Matrix for controlled release of growth factor for wound healing, has 

PT substrate that attaches heparin binding peptide, protein growth factor 

PT that bind heparin with low affinity, and heparin or heparin-like 

PT polymer 
XX 

PS Example 5; Page 21; 4 8pp; English. 
XX 

CC This invention relates to a matrix comprising a substrate capable of 

CC providing attachment of a heparin binding peptide (HBP) , a peptide 

CC comprising a binding domain which binds heparin with high affinity, 

CC heparin or heparin-like polymer, and a protein growth factor or peptide 

CC fragment which has a domain that binds heparin with low affinity. 

CC Included in the invention is a vascular graft comprising the matrix, 

CC which is capable of supporting cell adhesion. The matrix is used for 

CC delivering low heparin binding affinity growth factor proteins or 

CC peptides in a controlled manner suitable for wound healing. The matrix 

CC can be used in an article for treating dermal wounds, and in an 

CC implantable sterilized composition capable of supporting cell adhesion. 

CC The present sequence represents a growth factor protein. The protein is 

CC used in an example illustrating that non-heparin-binding growth factors 

CC can be released in a controlled manner from heparin-based drug delivery 

CC systems based on their low affinity for heparin. 

XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 22; Length 112; 
Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIMI II I IIIMN 1 1 1 III III Mill IMMMI I IMMI III 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVXALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 1 1 [ 1 1 ! 1 1 : i ; 1 1 1 ; 1 1 1 1 , i , i , 1 1 1 1 1 1 1 1 1 1 1 1 1 ii [ 1 1 1 m i ; 1 1 1 1 1 1 1 1 1 ! 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 112 



RESULT 15 
AAM51939 

ID AAM51939 standard; protein; 112 AA . 
XX 

AC AAM51939; 
XX 

DT 01-FEB-2002 (first entry) 
XX 

DE Human TGFbeta protein superfamily protein TGFbetal. 
XX 

KW Human; TGFbeta; transforming growth factor beta; mutant; antagonist; 

KW agonist; ectopic bone formation; psoriasis; muscular atrophy; scar; 

KW formation; fibrosis; cirrhosis; osteopathic; antipsoriatic; 

KW ant if ibrotic; hepatotropic ; vulnerary; TGFbetal. 
XX 

OS Homo sapiens. 
XX 

PN DE10026713-A1 . 
XX 

PD 06-DEC-2001. 
XX 

PF 30-MAY-2000; 2000DE-1026713 . 
XX 

PR 30-MAY-2000; 2000DE-1026713 . 
XX 

PA (SEBA/) SEBALD W. 
XX 

PI Sebald W, Nickel J; 
XX 

DR WPI; 2002-042559/06. 
XX 

PT New mutein of transforming growth factor-beta superfamily protein, 

PT useful for treating or preventing e.g. ectopic bone formation, competes 

PT for receptor binding 

XX 

PS Disclosure; Fig 6; 54pp; German. 
XX 

CC The present invention relates to muteins of a chain of a protein which, 

CC when in the form of a homodimer, has antagonistic or partial agonistic 

CC activity, and where the mutation results in the protein binding with low 

CC affinity to its receptor. The protein is a member of the transforming 

CC growth factor beta (TGFbeta) superfamily. The mutant sequences of the 

CC invention can be used in the treatment of diseases associated with the 

CC overexpression of TGFbeta family proteins, including ectopic bone 

CC formation, psoriasis, muscular atrophy, scar formation, fibrosis and 

CC cirrhosis. The present sequence is the human TGFbetal protein. 
XX 

SQ Sequence 112 AA; 

Query Match 91.1%; Score 624; DB 23; Length 112; 
Best Local Similarity 93.3%; Pred. No. 1.9e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

lllllllllll IMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIII 



Db 



1 ALDTN YCF S ST EKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 



Qy 
Db 



61 SLDTQYS KVLAL YNQHN PGASAAPCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRSCKCS 120 

MIIIIIIIIIIMMMIIIMIIIMIMIIIMIIMIIMIIIIIIIIIIIIIIII 

53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 112 



Search completed: October 28, 2003, 09:06:47 
Job time : 10.0108 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



October 28, 2003, 09:09:54 ; Search time 7.28061 Seconds 

(without alignments) 
2760.110 Million cell updates/sec 

US-10-017-372E-15 
685 

1 ALDTNYCFSSTDYKDDDDKE GRKPKVEQLSNM I VRSCKCS 120 



Scoring table: BL0SUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



629382 



Database : 



Publ ished_Appl icat ions_AA : * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep:* 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4 : / cgn2_6 /pt oda t a / 2 /pubpaa /US 0 6_PUBCOMB . pep : * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB . pep : * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep:* 

7 : /cgn2__6/ptodata/2/pubpaa/US08_NEW__PUB.pep: * 

8 : / cgn2_6 /pt oda t a / 2 /pubpaa /US 0 8_PUBCOMB . pep : * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 
1 0 : /cgn2_6 /pt odata /2 /pubpaa/US 0 9B_PUBCOMB . pep : 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep:* 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep 
14 : /cgn2__6/ptodata/2/pubpaa/US10B_PUBCOMB.pep 
15 : /cgn2__6/ptodata/2/pubpaa/US10C_PUBCOMB.pep 
16 : /cgn2_6/ptodata/2/pubpaa/USl 0_NEW_PUB . pep : 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: 
1 8 : /cgn2_6 /ptodata 1 2 /pubpaa/US6 0_PUBCOMB . pep : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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.7 


114 


15 


us- 


10 


-154 


-333-27 


Sequence 


27, 


Appl 


32 


491 


71, 


.7 


382 


11 


us- 


09 


-214 


-592-34 


Sequence 


34, 


Appl 


33 


490 


71 , 


.5 


112 


10 


us- 


09 


-813 


-271B-12 


Sequence 


12, 


Appl 


34 


489 


71 


.4 


383 


10 


us- 


09 


-756 


-283A-27 


Sequence 


27, 


Appl 


35 


486 


70 


. 9 


112 


10 


us- 


09 


-813 


-271B-6 


Sequence 


6, 


Appli 


36 


486 


70 


. 9 


113 


10 


us- 


09 


-813 


-398-15 


Sequence 


15, 


Appl 


37 


486 


70 


. 9 


114 


10 


us- 


09 


-813 


-459-24 


Sequence 


24, 


Appl 


38 


486 


70 


. 9 


114 


14 


us- 


10 


-115 


-406-23 


Sequence 


23, 


Appl 


39 


486 


70 . 


. 9 


114 


15 


us- 


10 


-154 


-333-25 


Sequence 


25, 


Appl 


40 


486 


70 


. 9 


115 


10 


us- 


09 


-880 


-708-27 


Sequence 


27, 


Appl 


41 


486 


70 , 


. 9 


410 


11 


us- 


09 


-214 


-592-22 


Sequence 


22, 


Appl 


42 


486 


70 , 


, 9 


412 


11 


us- 


09 


-214 


-592-19 


Sequence 


19, 


Appl 


43 


486 


70, 


.9 


412 


14 


us- 


10 


-028 


-158-21 


Sequence 


21, 


Appl 


44 


485 


70, 


,8 


412 


11 


us- 


09 


-214 


-592-24 


Sequence 


24, 


Appl 


45 


483 


70. 


. 5 


412 


11 


us- 


09 


-214 


-592-31 


Sequence 


31, 


Appl 



ALIGNMENTS 



RESULT 1 

US-09-813-271B-2 

; Sequence 2, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION : 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US200201 15834Alart is Patent Department 
STREET: 564 Morris Avenue 
CITY: Summit 
/ STATE: New Jersey 

COUNTRY: USA 
ZIP: 07901 
; COMPUTER READABLE FORM : 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12-Jul-95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25 -Jul -94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 - 2 003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION : 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 112 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
US-09-813-271B-2 

Query Match 91.1%; Score 624; DB 10; Length 112; 

Best Local Similarity 93.3%; Pred . No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IMMIMIII I M I M M M M 1 1 1 M 1 1 1 1 1 1 M M 11 11 II 1 1 11 ! M I 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 



Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMIVRSCKCS 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 
Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 112 



RESULT 2 

US-09-813-398-13 

Sequence 13, Application US/09813398 
Patent No. US20020169292A1 
GENERAL INFORMATION: 
APPLICANT: Bruce D. Weintraub 
APPLICANT: Mariusz W. Szkudlinski 
APPLICANT: University of Maryland 

TITLE OF INVENTION: CYSTINE KNOT GROWTH FACTOR MUTANTS 
FILE REFERENCE: UOFMD.003C1 

CURRENT APPLICATION NUMBER : US/09/8 13 , 3 98 
CURRENT FILING DATE: 2001-03-20 
PRIOR APPLICATION NUMBER: PCT/US99/05908 
PRIOR FILING DATE: 1999-03-19 
PRIOR APPLICATION NUMBER : PCT/US98/19772 
PRIOR FILING DATE : 1998-09-22 
NUMBER OF SEQ ID NOS : 41 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 13 
LENGTH: 113 
TYPE: PRT 

ORGANISM: HOMO SAP I EN 
US-09-813-398-13 

Query Match 91.1%; Score 624; DB 10; Length 113; 

Best Local Similarity 93.3%; Pred . No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1, 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

iiiiiiiiiii ii 1 1 1 ii 1 1 1 1 1 1: i m 1 1 1 1 ii 1 1 1 1 1 1 1 ii 1 1 1 M 

Db 2 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 53 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMIVRSCKCS 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMI 
Db 54 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 113 



RESULT 3 

US-09-813-459-22 

; Sequence 22, Application US/09813459 
; Patent No. US20020107369A1 
GENERAL INFORMATION: 

APPLICANT: Lee, Se-Jin 

Cunningham, No. US20020107369Aleen 
TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR- 10 
NUMBER OF SEQUENCES: 26 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Spensley Horn Jubas & Lubitz 
STREET: 1880 Century Park East, Suite 500 
CITY: Los Angeles 
STATE: California 
COUNTRY: USA 



ZIP: 90067 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/0 9/813,4 5 9 

FILING DATE: 20-Mar-2001 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/624,635 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: Wetherell, Jr., Ph.D., John R., 

REGISTRATION NUMBER: 31,678 

REFERENCE/DOCKET NUMBER: PD-3054 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619) 455-5100 

TELEFAX: (619) 455-5110 
INFORMATION FOR SEQ ID NO: 22: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 114 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
FEATURE : 

NAME /KEY : Protein 

LOCATION: 1..114 
SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
US-09-813-459-22 



Query Match 91.1%; Score 624; DB 10; Length 114; 

Best Local Similarity 93.3%; Pred. No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIIIIIIMI I ' !' I I 

Db 3 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 54 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 1 1 1< I II > 1 1 1 'i II I M 1 1 ! M 1 1 , 1 II 1 1 1 1 1 ,1 1 1 1, 1 II I j 1 1 1 1 1 'I IN 

Db 55 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 114 



RESULT 4 

US-10-115-406-21 

; Sequence 21, Application US/10115406 
; Publication No. US20020127612A1 
; GENERAL INFORMATION: 

; APPLICANT: THE JOHNS HOPKINS UNIVERSITY SCHOOL OF MEDICINE 
; APPLICANT: LEE, Se-Jm 

; TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-9 
; FILE REFERENCE: JHU1190-3 



; CURRENT APPLICATION NUMBER : US/10/115,406 

CURRENT FILING DATE : 2002-04-02 

PRIOR APPLICATION NUMBER : 09/301,520 

PRIOR FILING DATE: 1999-04-28 
; PRIOR APPLICATION NUMBER: US 09/172,062 

PRIOR FILING DATE: 1998-10-13 
; PRIOR APPLICATION NUMBER: US 08/491,835 
; PRIOR FILING DATE: 1995-10-23 
; PRIOR APPLICATION NUMBER : PCT/US94/00685 

PRIOR FILING DATE: 1994-01-12 

PRIOR APPLICATION NUMBER: US 08/003,303 

PRIOR FILING DATE: 1993-01-12 
; NUMBER OF SEQ ID NOS : 28 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 21 
LENGTH: 114 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-115-406-21 

Query Match 91.1%; Score 624; DB 14; Length 114; 

Best Local Similarity 93.3%; Pred. No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 60 

lllllllllll IIIIIMIMIIIIIIIIIIIIMIIMIIMIIIIIIIII 

Db 3 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 54 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 120 

MIIIL'MII ,'i 1 1 1 1 1 1 1 1 1 1 :l 1 i 1 1 1 ! I ( , , , 1 1 1 II i 'I I II 1 M | ; Ml || 1 1| 

Db 55 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 114 



RESULT 5 

US-10-154-333-23 

; Sequence 23, Application US/10154333 
; Publication No. US20030109684A1 
; GENERAL INFORMATION: 

APPLICANT: JOHNS HOPKINS UNIVERSITY 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-3 
NUMBER OF SEQUENCES: 2 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SPENSLEY HORN JUBAS & LUBITZ 

STREET: 188 0 CENTURY PARK EAST, FIFTH FLOOR 

CITY: LOS ANGELES 

STATE: CALIFORNIA 

COUNTRY: US 

ZIP: 90067 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER : US/ 10/ 154 , 333 

FILING DATE: 21-May-2002 

CLASSIFICATION: <Unknown> 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/3 8 9,705 

FILING DATE: 03-Sep-1999 
; APPLICATION NUMBER: 09/153,733 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
; NAME: WETHERELL , JR. Ph.D., JOHN R. 

REGISTRATION NUMBER: 31,678 

REFERENCE/DOCKET NUMBER: FD22 79 PCT 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619) 455-5100 

TELEFAX: (619) 455-5110 
INFORMATION FOR SEQ ID NO: 23: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 114 amino acids 

TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta 1 
FEATURE : 

NAME /KEY : Protein 

LOCATION: 1. .114 
SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
US-10-154-333-23 

Query Match 91.1%; Score 624; DB 15; Length 114; 

Best Local Similarity 93.3%; Pred. No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIMIIIIIII ! I M I II h I II 1 1 I III i Ml 1 1 1 1 1 1 1 1 1 1 1 i 1 1! Ill 

Db 3 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 54 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 II 1 1 1 1 II Ml II: ! I II 1 1 ! 1 1 1 1 M 1 1 1 ' 1 1 1 II I , II 1 1 1 1 1 II 1 1 1 1 1 I I Ml 

Db 55 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 114 



RESULT 6 

US-09-859-211-47 

; Sequence 47, Application US/09859211 

; Patent No. US20020157125A1 

; GENERAL INFORMATION: 

; APPLICANT: Lee, Se-Jin 

,- APPLICANT: McPherron, Alexandra C. 

; TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-8 

; FILE REFERENCE: 07265/144001 

; CURRENT APPLICATION NUMBER : US/ 0 9/ 859 , 211 

; CURRENT FILING DATE: 2001-05-15 

; PRIOR APPLICATION NUMBER: 09/019,070 

PRIOR FILING DATE: 1998-02-05 

PRIOR APPLICATION NUMBER: 08/862,445 
; PRIOR FILING DATE: 1997-05-23 

PRIOR APPLICATION NUMBER: 08/847,910 

PRIOR FILING DATE: 1997-04-28 



; PRIOR APPLICATION NUMBER: 08/795,071 
; PRIOR FILING DATE : 1997-02-05 

PRIOR APPLICATION NUMBER: 08/525,596 
; PRIOR FILING DATE: 1995-10-26 
; PRIOR APPLICATION NUMBER: PCT/US94 / 03 01 9 
; PRIOR FILING DATE: 1994-03-18 
; PRIOR APPLICATION NUMBER: 08/033,923 
; PRIOR FILING DATE: 1993-03-19 
; NUMBER OF SEQ ID NOS : 51 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 47 

LENGTH: 115 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-859-211-47 



Query Match 91.1%; Score 624; DB 10; Length 115; 

Best Local Similarity 93.3%; Pred. No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

I II II M ! 1 1 1 1 1 1 1 M 1 1 1 !! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 4 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 55 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 , 1 1 III II I ,' 1 1 1 1 1 1 II 1 1 1 1 II I Ml I Ml I! 1 1 1 

Db 56 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 115 



RESULT 7 

US-09-880-708-25 

; Sequence 25, Application US/09880708 
; Patent No. US20020165361A1 
GENERAL INFORMATION: 

APPLICANT: Lee, Se-Jin 
Huynh , Thanh 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-5 
NUMBER OF SEQUENCES: 28 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Gray Cary Ware & Freidenrich LLP 

STREET: 4365 Executive Drive, Suite 1600 

CITY: San Diego 

STATE: CA 

COUNTRY: USA 

ZIP: 92121-2189 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Windows95 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/880,708 

FILING DATE : 12-Jun-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/145,060 

FILING DATE: <Unknown> 

APPLICATION NUMBER: 08/003,144 



FILING DATE: 12-JAN-1993 
ATTORNEY / AGENT INFORMATION: 

NAME: Lisa A. Haile, Ph.D. 
REGISTRATION NUMBER: 38,347 
REFERENCE/DOCKET NUMBER: 07265/057002 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 858/677-1456 
TELEFAX: 619/677-1465 
INFORMATION FOR SEQ ID NO: 25: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 115 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
US-09-880-708-25 

Query Match 91.1%; Score 624; DB 10; Length 115; 

Best Local Similarity 93.3%; Pred . No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

1 1 1 II II 1 1 II I M I M M 1 1 1 M M 1 1 1 1 M II M M M I M M M M M I 

Db 4 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 55 

Qy 61 SLDTQYSKVXiALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIMIIIMI 

Db 56 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 115 



RESULT 8 

US-09-872-856-47 

; Sequence 47, Application US/09872856 
; Publication No. US20030074680A1 
; GENERAL INFORMATION: 

APPLICANT: Johns Hopkins University School of Medicine 
; APPLICANT: Lee, Se-Jin 

APPLICANT: McPherron, Alexandra 
; TITLE OF INVENTION: Growth Differentiation Factor-8 
; FILE REFERENCE: JHU1120-17 

; CURRENT APPLICATION NUMBER: US/09/872 , 856 

; CURRENT FILING DATE: 2001-06-01 

; PRIOR APPLICATION NUMBER: US 09/124,180 

; PRIOR FILING DATE: 1998-07-28 

; PRIOR APPLICATION NUMBER: US 09/019,070 

; PRIOR FILING DATE: 1998-02-05 

; PRIOR APPLICATION NUMBER: US 08/862,445 

PRIOR FILING DATE: 1997-05-23 

PRIOR APPLICATION NUMBER: US 08/847,910 

PRIOR FILING DATE: 1997-04-28 
; PRIOR APPLICATION NUMBER: US 08/795,071 
; PRIOR FILING DATE: 1997-02-05 

PRIOR APPLICATION NUMBER: US 08/525,596 
; PRIOR FILING DATE: 1995-10-25 
; PRIOR APPLICATION NUMBER: PCT/US 94/03019 



PRIOR FILING DATE: 1994-03-18 
PRIOR APPLICATION NUMBER: US 08/033,923 
PRIOR FILING DATE: 1993-03-19 
NUMBER OF SEQ ID NOS : 53 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 47 
LENGTH: 115 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-872-856-47 

Query Match 91.1%; Score 624; DB 11; Length 115; 

Best Local Similarity 93.3%; Pred. No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1, 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiiiii 'i 1 1: i ",i 'i i 1 1 1 1 ; i m i mi M h 1 1 iiiiiiii i 

Db 4 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFPT .fJPrPVT w 55 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

MMIMMIIIIMMI II 1 1 II 1 1 1 M I II 1 1 1 1 1 1 1 II I M I M I M 1 1 1 1 1 1 1 1 M 

Db 56 SLDTQYS KVLAL YNQHN PGASAAPCCVPQALE PLP I VY Y VGRKPKVEQLSNM I VRS CKCS 115 



RESULT 9 

US-10-335-483-29 

; Sequence 29, Application US/10335483 
; Publication No. US20030120058A1 
GENERAL INFORMATION: 

APPLICANT: Huynh, Thanh 
; Lee, Se-Jin 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-8 
; NUMBER OF SEQUENCES : 32 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 4225 Executive Square, Suite 1400 

CITY: La Jolla 

STATE : CA 

COUNTRY: US 

ZIP: 92037 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Windows 9 5 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/335 , 483 

FILING DATE: 31-Dec-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/177 , 860 

FILING DATE: <Unknown> 

APPLICATION NUMBER: 08/525,596 

FILING DATE: 19-SEP-1995 
; APPLICATION NUMBER: PCT/US94 / 07762 

FILING DATE: 08- JUL - 1 9 9 4 
ATTORNEY/AGENT INFORMATION: 



NAME: Wetherell, Jr., Ph.D, John R . 
REGISTRATION NUMBER: 31,678 
REFERENCE / DOCKET NUMBER : 07265/075001 
TELECOMMUNICATION INFORMATION : 
TELE PHONE : 619-678-5070 
TELEFAX: 619-678-5099 
INFORMATION FOR SEQ ID NO: 29: 
SEQUENCE CHARACTERISTICS : 

LENGTH: 115 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
FEATURE : 

NAME /KEY : Protein 
LOCATION: 1. .115 
SEQUENCE DESCRIPTION: SEQ ID NO : 29: 
US-10-335-483-29 

Query Match 91.1%; Score 624; DB 15; Length 115; 

Best Local Similarity 93.3%; Pred . No. 1.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIMMIMM 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 [ 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

Db 4 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 55 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

IIMIIIMIIIIIIMIMMIIIMIMIMIMMIIIIIMIIIIIIIIMMIII 

Db 56 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 115 



RESULT 10 
US-10-002-278-8 

; Sequence 8, Application US/10002278 

; Publication No. US2 002 0132 334 Al 

; GENERAL INFORMATION: 

; APPLICANT: Jessell, Thomas M. 

; APPLICANT: Basler, Konard 

; APPLICANT: Yamada , Toshiya 

; TITLE OF INVENTION: CLONING, EXPRESSION AND USES OF DORSALIN-1 

; FILE REFERENCE: 0575/40314-A 

; CURRENT APPLICATION NUMBER: US/10/002,278 

; CURRENT FILING DATE: 2001-11-02 

; NUMBER OF SEQ ID NOS : 18 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 13 9 

TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: COOH- terminus of TGF-Beta 1 
NAME/ KEY : DOMAIN 
LOCATION: (1) . . (13 9) 
OTHER INFORMATION: 
US-10-002-278-8 



Query Match 91.1%; Score 624; DB 14; Length 139; 

Best Local Similarity 93.3%; Pred. No. 1.9e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 



1; 



Qy 


1 


Db 


28 


Qy 


61 


Db 


80 



MIIMIIII IIIIMIIIMIIIMIIIIIIIIMIIIIIIIIIIIIIII 

L.DTNYCFSST EKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 7 9 



IIIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIMIIIIIIIIIIMIII 



RESULT 11 
US-09-214-592-25 

; Sequence 25, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTI VI TY- REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/ 0 9/2 14 , 5 92A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 2 5 

LENGTH: 315 

TYPE : PRT 

ORGANISM: bovine 
US-09-214-592-25 



Query Match 91.1%; Score 624; DB 11; Length 315; 

Best Local Similarity 93.3%; Pred. No. 4.5e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiiiii 1 1 1 1 1 , 1 1 i 1 1 1 1 , 1 . , 1 1 1 1 1 , , 1 1 ■ 1 1 1 ; . 1 1 ■ ; i 

Db 204 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 255 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

IIIIIIIIIMIIIIIIIIIIMIIMIIIIIIIIIIIMMIIIIMMIIIIIIIIII 

Db 256 SLDTQYSKVLAL YNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 12 
US-09-756-283A-23 

; Sequence 23, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Chernaj ovsky , Yuti 

; APPLICANT: Dreja, Hanna Stina 

; APPLICANT: Adams, Gillian 



; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 

/ CURRENT APPLICATION NUMBER: US/09/756 , 283A 
; CURRENT FILING DATE : 2001-01-09 
; NUMBER OF SEQ ID NOS : 10 0 

SOFTWARE: Patent In version 3.0 
; SEQ ID NO 23 

LENGTH: 3 90 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-756-283A-23 



Query Match 91.1%; Score 624; DB 10; Length 390; 

Best Local Similarity 93.3%; Pred. No. 5.6e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

1 1 1 1 1 1 1 1 1 1 1 N I I I I I I I I I I M I I M I I M ! I M I I I I I I I I I I I M I I 

Db 279 ALDTNYCFSST EKNCCVT^QLYIDFRKDLGWKWIHEPKGYH 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

IMIIIIMIMIIIIMIIIMIIMMIIIIIIIIIIMIMIMIIIIIIIIIIIII 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 13 
US-09-214-592-26 

; Sequence 26, Application US/09214592A 
; Publication No. US2003 00272 18A1 
; GENERAL INFORMATION: 

APPLICANT: Yamasaki , CMotoo 
; APPLICANT: Shibata , CKen j i 
; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER : US/ 0 9/2 1 4 , 5 92A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS: 34 
; SOFTWARE : 
; SEQ ID NO 26 

LENGTH: 3 90 

TYPE: PRT 

ORGANISM: porcine 
US-09-214-592-26 

Query Match 91.1%; Score 624; DB 11; Length 390: 

Best Local Similarity 93.3%; Pred. No. 5.6e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 60 

Mlllllllll lllllllllllllllllllllllllllllllllllllllll 
Db 279 ALDTNYCFSST EKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 33 0 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

in 1 1 ill 1 1 mm i mi ii 1 ' 1 1 "i ; 1 1 1 ';ri , 1 1 1; i ii i :i 1 1 , 1 1 1 



Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 14 
US-09-214-592-28 

Sequence 28, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki , CMotoo 
APPLICANT: Shibata , CKenj i 
APPLICANT: Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER : US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 28 
LENGTH: 390 
TYPE: PRT 
ORGANISM: canine 
US-09-214-592-28 

Query Match 91.1%; Score 624; DB 11; Length 390; 

Best Local Similarity 93.3%; Pred. No. 5.6e-57; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

milium mimmimmmimimimmmiii 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

ill ii i! 1 1 1 ill ill i, Mi, , ' 1 1 ' 1 'i i 1 1 1 1 ',, i,i i 1 1 1 1 1 1 1 ,,, i 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 15 
US-09-214-592-29 

; Sequence 29, Application US/09214592A 

; Publication No. US2 003 00272 18A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj l 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE : 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 5 92A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS: 34 
; SOFTWARE: 
; SEQ ID NO 2 9 

LENGTH: 3 90 

TYPE: PRT 



ORGANISM: ovine 
US-09-214-592-29 

Query Match 91.1%; Score 624; DB 11; Length 390; 

Best Local Similarity 93.3%; Pred . No. 5.6e-57 ; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

I II I II M I II IIIIIMIIIMIMIIIIMIIIIIIIIIIMIIIIIMI 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 II 1 1 1 1 1 1 M II 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



Search completed: October 28, 2003, 09:28:01 
Job time : 8.28061 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 28, 2003, 00:37:16 ; Search time 3.96533 Seconds 

(without alignments) 
2910.285 Million cell updates/sec 

US-10-017-372E-15 
685 

1 ALDTNYCFSSTDYKDDDDKE GRKPKVEQLSNMI VRSCKCS 12 0 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



PIR_76 : * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



283308 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 
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ALIGNMENTS 



RESULT 
A40057 



1 



transforming growth factor beta-1 precursor - bovine (fragment) 

N; Alternate names: beta-TGF; cartilage-inducing factor-A; EGF-dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF-type II 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 28-Feb-1992 #sequence_revision 28-Feb-1992 #text_change 16-Jul-1999 
C;Accession: A40057; A42320; A05284; A24322; B61439 

R;Van Obberghen-Schilling, E. ; Kondaiah, P.; Ludwig, R.L.; Sporn, M.B.; Baker, 
C . C . 

Mol. Endocrinol. 1, 693-698, 1987 

A; Title : Complementary deoxyribonucleic acid cloning of bovine transforming 
growth f actor-betal . 

A/Reference number: A40057; MUID: 91042552 ; PMID:3153459 
A /Access ion : A4 0057 
A; Molecule type: mRNA 
A;Residues: 1-315 <VAN> 

A; Cross-references: GB:M36271; NID -. gl63747 ; PIDN : AAA3 0778 . 1 ; PID:gl63748 
R;Ogawa, Y . ; Schmidt, D.K.; Dasch, J.R.; Chang, R.J.; Glaser, C.B. 
J. Biol. Chem. 267, 2325-2328, 1992 

A; Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A;Reference number: A42320; MUID: 92129307 ; PMID: 1733936 

A; Accession : A42 32 0 

A;Molecule type: protein 

A;Residues: 204-209, , X' ,211-217 <OGA> 

R;Roberts, A.B.; Anzano, M.A. ; Meyers, C.A. ; Wideman, J.; Blacher, R. ; Pan, 
Y.C.E.; Stein, S.; Lehrman, S.R.; Smith, J.M.; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A;Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A;Reference number: A05284; MUID : 84104793 ; PMID:6607069 
A; Accession : A05284 
A;Molecule type: protein 
A;Residues: 204-218 <R0B> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H. ; Rosen, D.M.; McPherson, J.M.; Conti, 
A.; Siegel, N.R.; Galluppi, G.R.; Piez, K . A . 
J. Biol. Chem. 261, 5693-5695, 1986 

A;Title: Cart i lage - indue ing factor-A. Apparent identity to transforming growth 
factor-beta . 

A;Reference number: A24322; MUID : 861 95954 ; PMID:3754555 
A; Accession: A24322 
A;Molecule type: protein 
A;Residues: 204-233 <SEY> 

R;Jin, Y.; Cox, D.A.; Knecht , R . ; Raschdorf, F. ; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 

A;Title: Separation, purification, and sequence identification of TGF-betal and 
TGF-beta2 from bovine milk. 

A;Reference number: A61439; MUID : 92 189724 ; PMID: 1799413 
A; Accession: B6143 9 
A;Molecule type: protein 

A; Residues : 2 04 -209,' X', 211-217,' XX', 22 0-232 <JIN> 

C; Comment: This polypeptide is composed of two polypeptide chains cross-linked 
by disulfide bonds. It has been found in neoplastic and non-neoplastic tissues. 
C; Comment: Type II TGF does not bind to the EGF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergistically with either type 
I TFG or EGF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 



C; Superfamily : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F;204-315/Product : transforming growth factor beta-1 #status experimental <MAT> 
F ; 7 , 61 , 101/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 91.1%; Score 624; DB 2; Length 315; 

Best Local Similarity 93.3%; Pred . No. 5.9e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 60 

MIMIIIMI i 1 E 1 1 1 ( 1 1 1 1 ! 1 1 1 1 1 ! I ( 1 1 1 1 1 i 1 1 ! 1 1 1 ( 1 1 1 1 1 1 ( 1 

Db 204 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 255 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

IIIIIIIIMIIIIIIMIMIIIMIIIIMIIIIIIIIIIIIIMIIIIIIIIIIMI 

Db 2 56 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 2 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N; Alternate names: growth- inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence_revision 19-Oct-1995 #text_change 08-Dec-2000 
C;Accession: A27513; A01395; A22290; 159664; S53444 
R;Derynck, R. ; Rhee, L. ; Chen, E.Y.; Van Til burg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A; Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A;Reference number: A27513; MUID: 87174845 ; PMID:3470709 
A; Access ion: A27513 
A; Molecule type: DNA 
A;ReSldueS: 1-390 <DER> 

A; CrOSS -references : GB:X05839; GB:Y00112; NID:g37097; PIDN : CAA29283 . 1 ; 
PID:gl212989 

R;Derynck, R . ; Jarrett, J. A.; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B.; Goeddel , D.V. 
Nature 316, 701-705, 1985 

A; Title: Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID : 85296301 ; PMID:3861940 
A; Access ion : AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, 'P' ,11-24, ' P» , 26-159, 'R' , 160-3 90 <DE2> 

A; Cross -references : GB:X02812; GB:J05114; NID:g37092; PIDN : CAA26580 . 1 ; 
PID:g37093 

A; Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino-terminal signal peptide 

R;Massague, J.; Like, B. 

J . Biol. Chem. 260, 2636-2645, 1985 

A;Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A;Reference number: A22290; MUID : 8513 1 019 ; PMID:2982829 

A; Access ion : A22290 

A; Molecule type: protein 

A;Residues: 279-295 ,' XX 298 - 301 <MAS> 



R;Urushizaki, Y. ; Niitsu, Y.; Terui, T. ; Koshida, Y. ; Mahara, K. ; Kohgo, Y. ; 
Urushizaki, I.; Takahashi, Y . ; I to, H. 
Tumor Res. 22, 41-55, 1987 

A;Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A; Reference number: 159664 
A;Accession: 159664 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 279-390 <RES> 

A;Cross-references: GB:M38449; NID : g33 9557 ; PIDN : AAA3 673 5 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A. A.; Qu, G.Y.; Iwata, K.K.; Fenyoe, D.; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J. 305, 87-92, 1995 

A,-Title: Physical and biological characterization of a growth- inhibitory 

activity purified from the neuroepithelioma cell line A673 . 

A; Reference number: S53444; MUID : 95126934 ; PMID:7826358 

A;AccesSlon: S53444 

A; Status : preliminary 

A; Molecule type: protein 

A;Residues: 279-297 <STA> 

C;Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C;Genetics : 

A; Gene: GDB : TGFB1 ; TGFB 

A; Cross -references : GDB:120729; OMIM:190180 
A ; Map position: 19ql3 . 2 - 19ql3 . 2 
C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F; 1-18/Domain: signal sequence #status predicted <SIG> 
F; 19 -2 78 /Domain : propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/ Product : transforming growth factor beta-1 #status experimental <MAT> 
F;82, 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 91.1%; Score 624; DB 1; Length 390; 

Best Local Similarity 93.3%; Pred. No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 60 

1 1 1 1 II 1 1 1 1 1 IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 3 
JC4 023 

transforming growth factor beta-1 - dog 
C;Species: Canis lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revis ion 14-Jul-1995 #text_change 24-Nov-1999 
C; Access ion: JC4 023 

R;Manning, A.M.; Auchampach, J. A.; Drong, R.F.; Slightom, J.L. 
Gene 155, 307-308, 1995 



A; Title: Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1-encoding gene. 

A ; Reference number: JC4023; MUID: 95237630 ; PMID:7721110 
A; Accession : JC4 023 
A; Molecule type: mRNA 
A;Residues: 1-390 <MAN> 

A;Cross-ref erences : GB:L34956; NID:g516071; PIDN : AAA514 58 . 1 ; PID:g516072 

C; Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C ; Genet ics : 

A;Gene: tgf-betal 

C; Super family: inhibin 

C; Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 91.1%; Score 624; DB 2; Length 390; 

Best Local Similarity 93.3%; Pred . No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0: Indels A • rs^ps i ; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIMIIIIM II I II 1 1 M II I M 1 1 1 1! M I M M M I II 1 1 M M I II I 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 1 1 1 n I II 1 1 1 1 N 'I I III Mi M 1 1 1 1 1 1 1 ! 1 1 '1 1 1 II M '1 1 1 1 1 1 1 1 , l , I 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 4 
A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops (green monkey, grivet) 

C;Date: 05-Oct-1988 #sequence_revision 05-Oct-1988 #text_change 24-Nov-1999 
C; Access ion: A26960 

R;Sharples, K. ; Plowman, G.D.; Rose, T.M.; Twardzik, D.R.; Purchio, A . F . 
DNA 6, 239-244, 1987 

A;Title: Cloning and sequence analysis of simian transforming growth factor-beta 
CDNA. 

A; Reference number: A26960; MUID : 87246074 ; PMID:3474130 
A; Access ion : A26 960 
A /Molecule type: mRNA 
A;Residues: 1-390 <SHA> 

A; Cross-references : GB:M16658; NID : gl76552; PIDN : AAA35369 . 1 ; PID:gl76553 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F ; 1-16/ Doma in : signal sequence #status predicted <SIG> 

F; 17-390/ Product : transforming growth factor beta #status predicted <MAT> 

Query Match 91.1%; Score 624; DB 2; Length 390; 

Best Local Similarity 93.3%; Pred. No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 60 

iiiiiiiiiii 1 1 1 1 1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 i 1 1 f i 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 



Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 



IIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIII 

Db 331 SLDTQYSKVXALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 
A27512 

transforming growth factor beta-1 precursor - pig 
N; Alternate names: TGF-beta 

C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revis ion 05-Jun-1988 #text_change 16-Jul-1999 

C;AcceSSion: A27512; A26356; 146657 

R;Derynck, R . ; Rhee, L . 

Nucleic Acids Res. 15, 3187, 1987 

A; Title: Sequence of the porcine transforming growth factor-beta precursor. 
A;Reference number: A27512; MUID : 87174844 ; PMID:3470708 
A;Accession: A27512 
A /Molecule type: mRNA 
A; Residues: 1-390 <DER> 

R;Cheifetz, S.; Weatherbee, J. A. ; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 
Lucas, R. ; Massague, J. 
Cell 48, 409-415, 1987 

A;Title: The transforming growth factor-beta system, a complex pattern of cross- 
reactive ligands and receptors. 

A;Reference number: A90890; MUID : 87102890 ; PMID:2879635 
A ,-Access ion: A26356 
A;Molecule type: protein 
A;Residues: 279-322 <CHE> 

R;Kondaiah, P.; Van Obberghen-Schill mg, E. ; Ludwig, R.L. ; Dhar, R . ; Sporn, 
M.B. ; Roberts, A . B . 

J. Biol. Chem. 263, 18313-18317, 1988 

A; Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 
Evidence for alternate splicing and polyadenylation . 
A/Reference number: 146657; MUID: 89054010; PMID: 2461367 
A; Accession: 14 6657 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <KON> 

A; Cross -references : GB:M23703; NID:g755044; PIDN : AAA646 16 . 1 ; PID:g755045 
C;Genetics : 

A; Gene: TGFB; TGF-beta -1 
C; Super family: inhibm 
C; Keywords: growth factor 

Query Match 91.1%; Score 624; DB 2; Length 390; 

Best Local Similarity 93.3%; Pred . No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIMIIIMI IMMMIIMIIIIIIIMIIIIIIIIIIIIIIMIIIM 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 6 



146463 

transforming growth factor beta-1 - sheep 

C;Species: Ovis orientalis aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text_change 24-Nov-1999 

C;Accession: 146463; S45115 

R,-Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene 150, 371-373, 1994 

A; Title: Sequence and chromosomal localisation of the gene encoding ovine latent 
transforming growth factor-beta 1. 

A;Reference number: 146463; MUID : 9512 1932 ; PMID:7821809 
A; Accession : 14 64 63 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-390 <W00> 

A; Cross-references : EMBL:X76916; NID:g496648; PIDN : CAA54242 . 1 ; PID:g496649 
A;Note: submitted to the EMBL Data Library, December 1993 
C; Superf amily : inhibm 

Query Match 91.1%; Score 624; DB 2; Length 3 90; 

Best Local Similarity 93.3%; Pred. No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLY I DFRKDLGWKW I HEPKGYHANFCLGPCPY IW 6 0 

Illllllllll 1 1 1 1 1 ! Ml 1 1 1 1 , 'I ! I III i I :, I II I II 1 , 1 1 ! 1 1 1 1! 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQ YS KVLALYNQHN PGASAAPCCVPQALE PLP I VY YVGRKP KVEQLSNM I VRS CKCS 12 0 

M 1 1 1 1 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 ! 1 1 1 M 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRS CKCS 390 



RESULT 7 
S01413 

transforming growth factor beta-1 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Access ion : SO 14 13 

R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Nucleic Acids Res. 16, 8730, 1988 

A; Title: Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A;Reference number: S01413; MUID : 88335639 ; PMID:3166520 
A; Access ion: SO 14 13 
A; Molecule type: DNA 
A; Residues: 1-3 91 <JAK> 

A; Cross-references : EMBL:X12373; NID:g63808; PIDN : CAA30933 . 1 ; PID:g63809 
C; Superf amily : inhibin 
C; Keywords: growth factor 

Query Match 91.1%; Score 624; DB 2; Length 391; 

Best Local Similarity 93.3%; Pred. No. 7.4e-58; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPY I W 60 

Illllllllll I I I I I I I II I I II I I II I I I I I I I I I I II I II I I I I I I I I I 

Db 280 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 331 



Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

M III I MINI II III MINIMI Ml MINI IIMIM III INN Mil MINI 

Db 332 SLDTQYS KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 391 



RESULT 8 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N; Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1986 #sequence_revision 04-Dec-1986 #text_change 24-Nov-1999 
C;Accession: A01396 

R;Derynck, R . ; Jarrett, J. A. ; Chen, E.Y.; Goeddel , D.V. 
J , Biol. Chem. 261, 4377-4379, 1986 

A; Title: The murine transforming growth factor-beta precursor. 
A;Reference number: A01396; MUID : 86168129 ; PMID:3007454 
A; Access ion: AO 13 96 
A; Molecule type: mRNA 
A;Residues: 1-390 <DER> 

A; Cross-references : GB:M13177; NID:g201952; PIDN : AAA4 0423 . 1 ; PID:g201953 
A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 
core of an amino- terminal signal peptide 

C;Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen; 
t rans f orma t i on 

F ; 1 -23 /Domain : signal sequence #status predicted <SIG> 
F; 24 -27 8 /Domain : propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/ Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136 , 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 90.7%; Score 621; DB 1; Length 390; 

Best Local Similarity 92.5%; Pred . No. 1.5e-57; 

Matches 111; Conservative 1; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MMMMMI 1 1 M 1 1 1 II M M II M M M M M 1 1 1 1 1 II I M I M M I 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

I I I I I I I I I II I I I I I I I I I I I : I I I I I I I I I I I I I I 1 I I I I ! 1 I I I I I I I I I I I I I I I I 
Db 331 SLDTQYSKVLALYNQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 9 
S10219 

transforming growth factor beta-1 precursor - rat 
N; Alternate names: TGF type 2; TGF-beta 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence_revision 12-Feb-1993 #text_change 24-Nov-1999 
C; Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A;Title: cDNA cloning by PCR of rat transforming growth factor beta-1. 



A; Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A; Accession : SI 02 19 
A; Molecule type: mRNA 
A/Residues : 1-390 <QIA> 

A; Cross -references: EMBL:X52498; NID:g57341; PIDN : CAA3674 1 . 1 ; PID:g57342 
R;Okada, F. ; Yamaguchi, K. ; Ichihara, A.; Nakamura , T. 
J. Biochem. 106, 304-310, 1989 

A;Title: Purification and structural analysis of a latent form of transforming 
growth factor-beta from rat platelets. 

A /Reference number: PT0023; MUID : 90036779; PMID:2478527 
A; Accession : PT0023 
A;Molecule type: protein 

A; Residues: 30-32, 'X 1 ,34-38, 'Q* , 40-42, 'X 1 ,44 <0KA> 
R;Okada, F. ; Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
FEBS Lett. 242, 240-244, 1989 

A;Title: One of two subunits of masking protein in latent TGF-beta is a part of 
pro-TGF-beta . 

A; Reference number: S02267; MUID: 89121078 : PMID : 291 4finq 
A /Accession: S022 67 
A;Molecule type: protein 

A;Residues: 30-32, 'X* ,34-38, *Q' ,40-42, 'X' ,44 <OK2> 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; integrin binding 
F; 1-29/Domain: signal sequence #status predicted <SIG> 
F;30-278/Domain: propeptide #status experimental <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F ; 82 , 136 , 17 6 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 90.7%; Score 621; DB 2; Length 3 90; 

Best Local Similarity 92.5%; Pred . No. 1.5e-57; 

Matches 111; Conservative 1; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIMIIIIII IIIIMIIMIIMMIIIIIIIIIIIIIMIIIIIIIMI 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

lllllllllllllllllllllhllllllllllMIIIIIIIIIIIIIMIIIIIIIIII 
Db 331 SLDTQ YS KVLALYNQHNPGAS AS PCCVPQALEPLP I VYYVGRKP KVEQLSNM I VRS CKCS 390 



RESULT 10 
148196 

transforming growth factor beta-1 precursor - golden hamster (fragment) 
C; Species: Mesocricetus auratus (golden hamster) 

C;Date: 02-Jul-1996 #sequence_revision 04-Oct-1996 #text_change 24-Nov-1999 
C; Access ion: 14 8196 

R;Wong, D.T.; Donoff, R.B.; Yang, J.; Song, B.Z.; Matossian, K. ; Nagura, N . ; 
Elovic, A.; McBride, J.; Gallagher, G.; Todd, R.; Chiang, T. ; Chou, L.S.S.; 
Yung, CM.; Galli, S.J.; Weller, P.F. 
Am. J. Pathol. 143, 130-142, 1993 

A;Title: Sequential expression of transforming growth factors alpha and beta 1 
by eosinophils during cutaneous wound healing in the hamster. 
A; Reference number: 148196; MUID : 933 044 79 ; PMID:8317544 
A; Access ion: 148196 

A; Status: preliminary; translated from GB/EMBL/DDBJ 



A; Molecule type: mRNA 
A;Residues: 1-130 <RES> 

A; Cross -references: EMBL:X60296; NID : g3 96177 ; PIDN:CAA42838 . 1; PID:g396178 
C; Superfamily : inhibin 

Query Match 88.9%; Score 609; DB 2; Length 130; 

Best Local Similarity 91.7%; Pred. No. 8.7e-57; 

Matches 110; Conservative 0; Mismatches 2; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 6 0 

M l M I ll M I i ii m M 1 1 1 ii i , : 1 1 ! 1 1 ii, 1 m 1 : 1 1 ii , 1 1 1 ! i 1 1 

Db 19 ALDTNYCFSST EKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 7 0 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

I ; I M, I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 II I M I Ii , 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 Ml 

Db 71 SLDTQYSKVLALYNQHNPGASAGPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSYKCS 130 



RESULT 11 
A41918 

transforming growth factor beta -4 precursor - chicken (fragment) 
N; Alternate names: TGF-beta 4 
C;Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R;Burt, D.W.; Jakowlew, S.B. 

Mol . Endocrinol. 6, 989-992, 1992 

A;Title: Correction: a new interpretation of a chicken transforming growth 
factor-beta 4 complementary DNA. 

A; Reference number: A41918; MUID : 92357039 ; PMID:1353860 
A; Access ion-. A41918 
A; Molecule type: mRNA 
A;Residues: 1-373 <BUR> 

A; Cross -references: GB:M31160; GB:X08012; GB:S41706; NID : gl262437 ; 
PIDN:AAB05637.1; PID:gl262438 

A;Note: sequence extracted from NCBI backbone (NCBIN : 110186, NCBI P : 110187) 
A;Note: this report corrects and reinterprets the sequence from reference A34 941 
R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 2, 1186-1195, 1988 

A; Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta-4 from chicken embryo 
chondrocytes . 

A;Reference number: A34941; MUID: 89112198 ; PMID:2464131 
A; Access ion : A3 4 941 
A;Molecule type: mRNA 

A; Residues: ' MDPMSIGPKSCGGSPWRPPGTAPWSIGSRRATASSSCSTSSRVRAEVGGRAL 1 , 122- 
209, ' D 1 ,211-373 <JAK> 
A;Cross-ref erences : EMBL:X08012 

A;Note: this sequence has been corrected in A41918 
C; Superfamily: inhibin 

C; Keywords: glycoprotein; growth factor 

F;l/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region.- cell attachment (R-G-D) motif 

F;260-373/Product : transforming growth factor beta-4 #status predicted <MAT> 
F;54 , 109, 153/Bmding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 



78.8%; Score 540; DB 2; Length 373; 



Best Local Similarity 79.2%; Pred. No. 4.7e-49; 

Matches 95; Conservative 8; Mismatches 9; Indels 8; Gaps 2; 



Qy 2 LDTN Y CFS - STDY KDDDDKEKN CCVRQL Y I DFR KDLGWKW I H E PKG YHAN F CLG P C P Y I W 60 

Ilhlll II lllllll lllllllll MINIMI! MlhlMIMI 

Db 261 LDTDYCFGPGTD EKNCCVRPLYI DFRKDLQWKWIHEPKGYMANFCMGPCPYIW 313 

Qy 61 SLDTQYSKVTaALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

I 1 1 1 h 1 1 1 II 1 1 1 II I M M M 1 1 1 1 1 h I II h M 1 1 1 = 1 1 1 1 1 1 h I h 1 1 1 1 

Db 314 SADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPI I YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
B61036 

transforming growth factor beta-5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence_revision 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands, M.J.; Smith, J.M.; Fields, A.; Roberts, A.B.; Sporn, 
M.B. ; Melton, D . A . 

J. Biol. Chem. 265, 1089-1093, 1990 

A; Title-. Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis. 

A;Reference number: A34929; MUID : 90110090 ; PMID: 2295601 
A; Accession: A34 92 9 
A; Molecule type: mRNA 
A ; Residues: 1-382 <KON> 

A; Cross-references : GB:J05180; NID:g214821; PIDN: AAA49 968 . 1 ; PID:g214822 
R;Roberts, A.B.; Rosa, F.; Roche, N.S.; Coligan, J.E.; Garfield, M. ; Rebbert , 
M.L.; Kondaiah, P.; Danielpour, D. ; Kehrl , J.H.; Wahl, S.M.; Dawid, I.B.; Sporn, 
M.B. 

Growth Factors 2, 135-147, 1990 

A;Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A ; Reference number: A61036; MUID: 90253806 ; PMID.-2340184 
A; Access ion : B61036 
A;Molecule type: protein 

A;Residues: 271-276, 'X' ,278-284, 'XX' ,287-299 <ROB> 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F; 271-382/ Product : transforming growth factor beta-5 tfstatus experimental <MAT> 

Query Match 71.7%; Score 4 91; DB 2; Length 3 82; 

Best Local Similarity 73.0%; Pred. No. 6.8e-44; 

Matches 84; Conservative 10; Mismatches 13; Indels 8; Gaps 1; 
Qy 6 YCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQ 65 

Ml = MM: M I : I M M I ! I M M ! ! M 11-111 IMMhM! 

Db 276 YCFGNNG PNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQ 327 

Qy 66 YSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

IIMhIllhlllll MIMI 1 1 1 1 1 ! : 1 1 1 1 1 lllllllhllll II 

Db 328 YSKVLSLYNQNNPGAS I SPCCVPDVLEPLP 1 1 YYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 
A41397 



transforming growth factor beta -3 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 03-Apr-1992 #sequence_revis ion 03-Apr-1992 #text_change 16-Jul-1999 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A. ; Lee, A.; Matsui, Y . ; Chen, E.Y.; Moses, H.L.; Derynck, R . 
Mol. Endocrinol. 3, 1926-1934, 1989 

A; Title: Complementary DNA cloning of the murine transforming growth factor- 

beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 

TGFbetal messenger RNA in murine embryos and adult tissues. 

A; Reference number: A41397; MUID : 90190650 ; PMID:2628730 

A ,-Accession: A413 97 

A /Molecule type: mRNA 

A;Residues: 1-410 <MIL> 

A; Cross-references : GB:M32745; NID ; g201949 ; PIDN : AAA40422 . 1; PID:g201950 
R/Denhez, F . / Lafyatis, R. ; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A; Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A/Reference number: A61039; MUID : 91000714 ; PMID:2206556 
A/Accession: A61039 
A; Molecule type: mRNA 
A;ResldueS: 1-410 <DEN> 

R;Watrin, F. ; Scotto, L. ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A;Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A/Reference number: A61225; MUID : 91299576 ; PMID:2069871 

A; Accession: A61225 

A; Status: translation not shown 

A /Molecule type: mRNA 

A/Residues : 285-410 <WAT> 

C; Superf amily : inhibin 

C; Keywords: glycoprotein/ growth factor/ growth regulation 
F; 1 -21 /Domain : signal sequence #status predicted <SIG> 
F/22-298/Domain: propeptide #status predicted <PRO> 
F/259-261/Region: cell attachment (R-G-D) motif 

F/299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 72, 133, 140/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 70.9%/ Score 4 86/ DB 2; Length 410/ 

Best Local Similarity 71.7%; Pred . No. 2.5e-43; 

Matches 86/ Conservative 11; Mismatches 15; Indels 8/ Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

MINIM : M M 1 1 1 I ! ! M ! = 1 1 1 M I : I M 1 1 1 : ! 1 1 1 MMM 

Db 299 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 350 

Qy 61 SLDTQYSKVTALYNQHNPGASAAPCCVPOALEPLPI VYYVGRKPKVRQLSNMI VESCKCS 120 

I II • I II III II llhllllll MM MIIMI IIIIIIIIMhIMII 

Db 351 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



RESULT 14 
A36169 

transforming growth factor beta-3 precursor - human 
C; Species: Homo sapiens (man) 

C/Date: 14-Dec-1990 #sequence_revision 14-Dec-1990 #text_change 20-Jun-2000 



C/Accession: A36169; A41262; S01824 

R;ten Dijke, P.; Hansen, P . ; Iwata, K.K.; Pieler, C. ; Foulkes, J.G. 
Proc. Natl. Acad. Sci. U.S.A. 85, 4715-4719, 1988 

A;Title: Identification of another member of the transforming growth factor type 
beta gene family. 

A /Reference number: A36169; MUID : 88263019 ; PMID:3164476 
A /Access ion : A3 616 9 

A; Status : preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A/Residues : 1-412 <TEN> 

A; Cross-references: GB:J03241; NID:g339551; PIDN : AAA61161 . 1 ; PID : g339552 
R;Arrick, B.A.; Lee, A.L.; Grendell, R.L.; Derynck, R. 
Mol. Cell. Biol. 11, 4306-4313, 1991 

A; Title: Inhibition of translation of transforming growth factor-beta3 mRNA by 
its 5' untranslated region. 

A; Reference number: A41262; MUID : 91342629; PMID:1875922 

A; Accession : A41262 

A; Molecule type: DNA 

A; Residues: 1-48 <ARR> 

A; Cross-references : GB : M58524 

R;Derynck, R . ; Lindquist, P.B.; Lee, A.; Wen, D. ; Tamm, J.; Graycar, J.L.; Rhee, 
L . ; Mason, A.J.; Miller, D.A.; Coffey, R.J.; Moses, H.L.; Chen, E.Y. 
EMBO J. 7, 3737-3743, 1988 

A;Title: A new type of transforming growth factor-beta, TGF-beta3. 
A;Reference number: S01824; MUID: 89091120; PMID: 3208746 
A; Access ion: SO 1824 
A; Molecule type: mRNA 
A; Residues: 3-412 <DER> 

A; Cross-references : EMBL:X14885; NID:g37075; PIDN : CAA33 024 . 1 ; PID:gl200236 

C;Genetics : 

A; Gene: GDB : TGFB3 

A; Cross -references : GDB: 120437; OMIM: 190230 
A ; Map position: 14q24-14q24 

A;Introns: 118/1; 172/3; 216/1; 252/1; 309/2; 360/3 

C; Superfamily : inhibin 

C; Keywords: growth factor; homodimer 

F ; 1-27/Domain signal sequence #status predicted <SIG> 
F;28-300/Domain: propeptide tfstatus predicted <PRO> 

F; 3 01 -4 12 /Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 70.9%; Score 486; DB 2; Length 412; 

Best Local Similarity 71.7%; Pred. No. 2.5e-43; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIM : hlllll lllllhlllllhlllllhllll Mllh 

Db 301 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 352 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 12 0 

I II M 1 1 Ml 1 1 II h I II M I MM hlllll M I II I II h h II I II 

Db 353 S ADTTHST VLGLYNTLN P EAS AS PCC VPQDLE PLT I LYYVGRTPKVEQLSNMWKS CKCS 412 



RESULT 15 
A55706 

transforming growth factor beta-3 precursor - rat 
C; Species-. Rattus norvegicus (Norway rat) 



C;Date: 03-Mar-1995 #sequence_revision 03-Mar-1995 #text__change 16-Jul-1999 
C;Accession: A55706; B40699; S36042 

R;Wang, J.; Kuliszewski, M . ; Yee, W. / Sedlackova, L. ; Xu, J.; Tseu, I.; Post, M. 
J. Biol. Chem. 270, 2722-2728, 1995 

A;Title: Cloning and expression of glucocort icoid- induced genes in fetal rat 

lung fibroblasts. Transforming growth factor-beta -3 . 

A;Reference number: A55706; MU1D: 95155340 ; PMID:7852342 

A; Accession : A55706 

A; Molecule type: mRNA 

A;ResidueS: 1-410 <WAN> 

A; Cross-references : GB:U03491 

A;Note: it is uncertain whether Met-1 is the initiator 
R;McKinnon, R.D. ; Piras, G. ; Ida Jr., J. A.; Dubois -Dalcq, M. 
J. Cell Biol. 121, 1397-1407, 1993 

A;Title: A role for TGF-beta in oligodendrocyte differentiation. 

A;Reference number: A40699; MUID : 93286190 ; PMID: 8509457 

A /Access ion: B4 069 9 

A; Status: preliminary 

A; Molecule type: mRNA 

A;Residues: 157-211 <MCK> 

A; Cross-references : EMBL-.X71903; NID:g311326; PIDN : CAA50722 . 1 ; PID-.g311327 
C; Super family : inhibin 

Query Match 70.8%; Score 485; DB 2; Length 410; 

Best Local Similarity 71.7%; Pred. No. 3.1e-43; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIIIMI = hlllll 1 I ■ I ' ' ' I I ' I , llllh 

Db 299 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 350 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 



Db 



351 SSDTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



Search completed: October 28, 2003, 09:09:48 
Job time : 3.96533 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 28, 2003, 00:08:41 



; Search time 2.4 052 Seconds 

(without alignments) 

2346.251 Million rpll updates /sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-15 
685 

1 ALDTNYCFSSTDYKDDDDKE 



GRKPKVEQLSNMI VRSCKCS 12 0 



Scoring table: BLOSUM62 

Gapop 10.0 



Gapext 0 . 5 



Searched : 



127863 seqs, 47026705 residues 



Total number of hits satisfying chosen parameters: 127863 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt 41:* 



Pred . No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


624 


91 


. 1 


315 


1 


TGF1_B0VIN 


P18341 


bos taurus 


2 


624 


91 


. 1 


390 


1 


TGF1_CANFA 


P54831 


cams famil 


3 


624 


91 


. 1 


390 


1 


TGF1_CERAE 


P09533 


cercopithec 


4 


624 


91 


. 1 


390 


1 


TGF1_HUMAN 


P01137 


homo sapien 


5 


624 


91 


. 1 


390 


1 


TGF1 PIG 


P07200 


sus scrofa 


6 


624 


91 


. 1 


390 


1 


TGF1_SHEEP 


P50414 


ovis aries 


7 


621 


90 


. 7 


390 


1 


TGF1_M0USE 


P04202 


mus musculu 


8 


621 


90 


.7 


390 


1 


TGF1_RAT 


P17246 


rattus norv 


9 


620 


90 


. 5 


390 


1 


TGF1_H0RSE 


019011 


equus cabal 


10 


612 


89 


.3 


390 


1 


TGF1 CAVPO 


Q9zly6 


cavia porce 


11 


540 


78 


. 8 


373 


1 


TGF1 CHICK 


P09531 


gallus gall 


12 


491 


71 


. 7 


382 


1 


TGF1_XENLA 


P16176 


xenopus lae 


13 


486 


70. 


. 9 


410 


1 


TGF3JV10USE 


P17125 


mus musculu 


14 


486 


70. 


. 9 


412 


1 


TGF3_HUMAN 


P10600 


homo sapien 


15 


485 


70 


.8 


412 


1 


TGF3 RAT 


Q07258 


rattus norv 


16 


479 


69. 


, 9 


409 


1 


TGF3_PIG 


P15203 


sus scrofa 


17 


475 


69 


.3 


412 


1 


TGF2_CHICK 


P30371 


gallus gall 


18 


468 


68 


.3 


112 


1 


TGF2__B0VIN 


P21214 


bos taurus 


19 


468 


68 , 


. 3 


414 


1 


TGF2__HUMAN 


P08112 


homo sapien 


20 


468 


68 


.3 


435 


1 


TGF2_PIG 


P09858 


sus scrofa 


21 


466 


68 


. 0 


414 


1 


TGF2_MOUSE 


P27090 


mus musculu 


22 


466 


68 . 


. 0 


442 


1 


TGF2_RAT 


Q07257 


rattus norv 


23 


464 


67 


.7 


412 


1 


TGF3 CHICK 


P16047 


gallus gall 


24 


464 


67 


.7 


413 


1 


TGF2_XENLA 


P17247 


xenopus lae 


25 


209 


30. 


. 5 


425 


1 


IHBA_SHEEP 


P43032 


ovis aries 


26 


209 


30. 


. 5 


426 


1 


IHBA^HORSE 


P55102 


equus cabal 


27 


205 


29 


.9 


424 


1 


IHBA MOUSE 


Q0499R 


mus fltu sculu 


28 


205 


29. 


, 9 


424 


1 


IHBA__PIG 


P03970 


sus scrofa 


29 


205 


29. 


. 9 


424 


1 


IHBA_RAT 


P18331 


rattus norv 


30 


205 


29. 


. 9 


425 


1 


IHBA_BOVIN 


P07995 


bos taurus 


31 


205 


29. 


9 


426 


1 


IHBA_HUMAN 


P08476 


homo sapien 


32 


202 . 5 


29. 


. 6 


374 


1 


GDF8_BRARE 


042222 


brachydanio 


33 


196.5 


28. 


.7 


375 


1 


GDF8_PAPHA 


018828 


papio hamad 


34 


196 


28 . 


, 6 


424 


1 


IHBA__CHICK 


P27092 


gallus gall 


35 


193.5 


28 . 


2 


375 


1 


GDF8 SHEEP 


018830 


ovis aries 


36 


192 . 5 


28 . 


1 


375 


1 


GDF8 CHICK 


042220 


gallus gall 



37 


192. 


5 


28 . 


, 1 


375 


1 


GDF8_HUMAN 


014793 


homo sap i en 


38 


192 . 


. 5 


28 . 


. 1 


375 


1 


GDF8_MELGA 


042221 


meleagris g 


39 


192 . 


. 5 


28 . 


, 1 


375 


1 


GDF8_PIG 


018831 


sus scrofa 


40 


192 . 


, 5 


28 . 


. 1 


376 


1 


GDF8_MOUSE 


008689 


mus musculu 


41 


192. 


5 


28 . 


, 1 


376 


1 


GDF8_RAT 


035312 


rattus norv 


42 


190. 


5 


27. 


.8 


405 


1 


GDFB MOUSE 


Q9zlw4 


mus musculu 


43 


190. 


. 5 


27. 


, 8 


407 


1 


GDFB HUMAN 


095390 


homo sapien 


44 


187 . 


. 5 


27. 


. 4 


375 


1 


GDF8_BOVIN 


018836 


bos taurus 


45 


U 


]6 


27, 


,2 


355 


1 


DVRl^BRARE 


P35621 


brachydanio 



ALIGNMENTS 



RESULT 1 
TGF1_B0VIN 

ID TGF1_B0VIN STANDARD; PRT; 315 AA. 



AC P18341; 

DT 01-NOV-1990 (Rel. 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa ; Chordata ; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91042552; PubMed=3 1534 59 ; 

RA van Obberghen-Schill ing E., Kondaiah P., Ludwig R.L. , Sporn M.B., 

RA Baker C.C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1 . " ; 

RL Mol. Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 

RC TISSUE=Bone; 

RX MEDLINE=9212 93 07; PubMed=1733 93 6 ; 

RA Ogawa Y. , Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforming growth factor-beta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone."; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-RRTA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. Heterodimers of TGF-beta 1/2 

CC have been found in bone. 

CC -!- SUBCELLULAR LOCATION; Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 



CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; M36271; AAA30778.1; 

DR PIR; A40057; A40057. 

DR HSSP; P01137; 1KIA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro / I PRO 03 911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; I PRO 01111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF0068 8; TGFb_propept ide ; 1. 

DR PRINTS ; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


N0N_TER 


1 


1 






FT 


PROPEP 


<1 


203 






FT 


CHAIN 


204 


315 




TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


210 


219 




BY SIMILARITY. 


FT 


DISULFID 


218 


281 




BY SIMILARITY. 


FT 


DISULFID 


247 


312 




BY SIMILARITY. 


FT 


DISULFID 


251 


314 




BY SIMILARITY. 


FT 


DISULFID 


280 


280 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


7 


7 




N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


61 


61 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


169 


171 




CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


315 AA; 


36269 MW; 


C2717A23D994E00E CRC64; 


Query Match 




91 , 


■ 1%; 


Score 624; DB 1; Length 315; 


Best Local Similarity 


93 . 


,3%; 


Pred. No. 1.9e-59; 



Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IMIIIIIIII IIIIIIIMIIIIIIIIIIIIIIIIIIIIIMMIIIMII 

Db 204 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 255 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

MINIMI MINIM MINI Ml MM MINI I MINIMI MM Ml Mill II 

Db 256 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 2 
TGF1_CANFA 

ID TGF1_CANFA STANDARD; PRT ; 3 90 AA. 

AC P54831; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 



OS 

oc 
oc 
ox 

RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



Canis familiaris (Dog) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 
NCBI_TaxID=9615; 
[1] 

SEQUENCE FROM N.A. 
TISSUE=Jugular vein endothelial; 
MEDLINE=9523763 0; PubMed=772 1110 ; 

Manning A.M., Auchampach J. A., Drong R.F., Slightom J.L.; 

"Cloning of a canine cDNA homologous to the human transforming growth 

factor-beta 1 -encoding gene."; 

Gene 155 :307-308 (1995) . 

-!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 
TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 
HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 
ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 
A POSITIVE OR NEGATIVE DIRECTION OF THKTR FFFKPTQ 
SUBUNIT: Homodimer; disul fide- 1 inked . 
SUBCELLULAR LOCATION: Secreted. 
SIMILARITY: Belongs to the TGF-beta family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 



1; -. 



EMBL; L34956; AAA51458 
PIR; JC4023; JC4023. 
HSSP; P01137; 1KLA . 
InterPro; IPR002400; GF_cysknot . 
InterPro; IPR003911; TGF_TGFb . 
InterPro; IPR001839; TGFb . 
InterPro; IPR001111; TGFb_N . 
Pfam; PF00019; TGF-beta; 1. 
Pfam; PF00688; TGFb_propept ide ; 1. 
PRINTS; PR00438; GFCYSKNOT. 
PRINTS; PRO 142 3; TGFB ETA . 
ProDom; PD0003 57; TGFb; 1. 
SMART; SM00204; TGFB; 1. 
PROSITE; PS00250; TGF__BETA_1; 1. 
Growth factor; Mitogen; Glycoprotein; Signal 



FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


PROPEP 


24 


278 


BY SIMILARITY. 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR RRTA 1 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN. 




FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 



SQ SEQUENCE 390 AA; 44185 MW; EB478 0E8 8B7B590E CRC64 ; 



Query Match 91.1%; Score 624; DB 1; Length 390; 

Best Local Similarity 93.3%; Pred. No. 2.4e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

lllllllllll IIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 279 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

i 1 1 1 1 1 1 1 11 1 m j 1 1 1 1 1 1 1 1 ii 1 1 ii ',i 1 1 'i i r 1 1 1 1 1 n 1 1 1 1 1 1 ' m i 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 3 
TGF1_CERAE 

ID TGF1 CERAE STANDARD ; PRT; 39 0 AA. 

AC P09533; 

DT 01-MAR-1989 (Rel. 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae,- 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87246074; PubMed=34 74 13 0 ; 

RA Sharpies K. , Plowman G.D., Rose T.M., Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA . " ; 

RL DNA 6:239-244 (1987) . 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense®isb-sib. ch) . 

CC 

DR EMBL; M16658; AAA35369.1; 

DR PIR; A26960; A26960. 

DR HSSP; P01137; 1KLA . 



DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 



DR 


PRINTS ; 


PR00438; 


GFCYSKNOT . 




DR 


PRINTS; 


PR01423; 


TGFBETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM0 02 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA_1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


16 


POTENTIAL . 


FT 


PROPEP 


17 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1 . 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 44356 MW; 


DFF63E2BAB44320E CRC64; 



Query Match 91.1%; Score 624; DB 1; Length 3 90; 

Best Local Similarity 93.3%; Pred. No. 2.4e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCP YIW 60 

Illllllllll IIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 279 ALDTNYCFSST EKNCCWQLY I DFRKDLGWKW I HEPKGYHANFCLGPCP YIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

M M 1 1 1 1 1 1 1 1 . 1 1 1 M 1 ,1 1 II 1 1 II i M 1 1 1 1 1 1 Ml m: M 1 1 1 1 :i 1 1 1 1 ; II 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 4 
TGF INHUMAN 

ID TGF1_HUMAN STANDARD; PRT; 3 90 AA. 

AC P01137; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 OR TGFB. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87174 84 5; PubMed=3 470709 ; 

RA Derynck R., Rhee L . , Chen E.Y., van Tilburg A. ; 

RT "Intron-exon structure of the human transforming growth factor-beta 



RT precursor gene." ; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-85296301; PubMed=38 6194 0 ; 

RA Derynck R., Jarrett J. A., Chen E.Y., Eaton D.H., Bell J.R., 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V. ; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Duodenum, and Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H. , Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. # Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J.. Hsieh F.. 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M., Soares M.B., Bonaldo M.F. # Casavant T.L. , Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carnmci P . , Prange C. , 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J. , McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S. , Garcia A.M., Gay L. J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M . , Madan A., Rodrigues S. , Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M . A . ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N.A. 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y., Niitsu Y., Terui T. , Koshida Y . , Mahara K. , Kohgo Y w 

RA Urushizaki I., Takahashi Y., Ito H. ; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli."; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TISSUE=Bladder carcinoma; 

RX MEDLINE=93229900; PubMed=8471846 ; 

RA Bourdrel L., Lin C.-H., Lauren S.L., Elmore R.H., Sugarman B.J., 

RA Hu S., Westcott K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDLINE=8 5131019; PubMed=2 98282 9 ; 

RA Massague J., Like B.; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 



RL J. Biol. Chem. 260:2636-2645(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144319; PubMed=8424 94 2 ; 

RA Archer S.J., Bax A . , Roberts A.B., Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 

RL Biochemistry 32:1152-1163(1993). 

RN [8] 

RP STRUCTURE BY NMR OF 279-3 90. 

RX MEDLINE=93144320; PubMed=8424 943 ; 

RA Archer S.J., Bax A., Roberts A.B., Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J . , 

RA Torchia D.A. ; 

RT " Trans f orminq growth factor beta 1: secondary st- rnr-t-nv^ *g determined 

RT by heteronuclear magnetic resonance spectroscopy ." ; 

RL Biochemistry 32:1164-1171(1993). 

RN [9] 

RP STRUCTURE BY NMR OF 279-3 90. 

RX MEDLINE=96266150; PubMed=8679613 ; 

RA Hinck A.P., Archer S. J. , Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 

RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X05839; CAA29283.1; 

DR EMBL; X05840; CAA29283.1; JOINED. 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT__SEQ . 

DR EMBL; BC001180; AAH01180.1; -. 

DR EMBL; BC000125; AAH00125.1; -. 



DR EMBL; BC022242; AAH22242.1; 

DR EMBL; M38449; AAA36735.1; 

DR PIR; A27513; WFHU2 . 

DR PDB; 1KLA; 17-AUG-96. 

DR PDB; 1KLC; 17-AUG-96. 

DR PDB; 1KLD; 17-AUG-96. 

DR Genew; HGNC : 11766; TGFB1 . 

DR MIM; 190180; 

DR GO; GO: 0006916; P : ant i -apoptosis ; TAS . 

DR InterPro ; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS 0 02 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 




FT 


DISULFID 


293 


356 




FT 


DISULFID 


322 


387 




FT 


DISULFID 


326 


389 




FT 


DISULFID 


355 


355 


INTERCHAIN. 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


10 


10 


L -> P (IN REF. 2) . 


FT 


CONFLICT 


159 


159 


R -> RR (IN REF . 2) . 


FT 


STRAND 


281 


281 




FT 


TURN 


282 


287 




FT 


STRAND 


294 


296 




FT 


STRAND 


300 


300 




FT 


TURN 


302 


305 




FT 


STRAND 


313 


313 




FT 


STRAND 


317 


317 




FT 


STRAND 


321 


323 




FT 


HELIX 


335 


346 




FT 


TURN 


348 


349 




FT 


STRAND 


358 


370 




FT 


TURN 


371 


372 




FT 


STRAND 


373 


387 




SQ 


SEQUENCE 


390 AA; 


44341 


MW; 75391614250288FE CRC64 ; 



Query Match 91.1%; Score 624; DB 1; Length 390; 

Best Local Similarity 93.3%; Pred. No. 2.4e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

1 1 M 1 1 1 II 1 1 1 1 1 II I II I II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIW 33 0 



Qy 61 SLDTQYSKVTALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMIVRSCKCS 12 0 

1 1 , 1 1 ii ! 1 1 1 1 1 1 1 in 1 1 1 1 1 1 ii m ii m 1 1 1 1 1 ii 1 1 ' i ii i r ' 1 1 1 ;i 1 1 1 ' i , 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 
TGF1_PIG 

ID TGF1__PIG STANDARD; PRT ; 390 AA . 

AC P07200; P08832; 

DT 01-APR-1988 (Rel . 07, Created) 

DT 01-APR-1988 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactvla ; Suina ; Suidae; Sus 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=87174844; PubMed=3470708 ; 

RA Derynck R. , Rhee L. ; 

RT "Sequence of the porcine transforming growth factor-beta precursor."; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A. , AND VARIANT VAL-114. 

RC STRAIN=Mmiature swine; 

RX MEDLINE=89054010; PubMed=24 613 67 ; 

RA Kondaiah P., van Obberghen-Schil 1 ing E. , Ludwig R.L., Dhar R., 

RA Sporn M.B., Roberts A.B.; 

RT " cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylation . " ; 

RL J. Biol. Chem. 263:18313-18317(1988). 

RN [3] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RX MEDLINE=88335639; PubMed=3 16652 0 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowlew S . B . ; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S., Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . " / 

RL Submitted (DEC-2001) to the EMBL/ GenBank/DDB J databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=87102 8 90; PubMed=28 7 963 5 ; 

RA Cheifetz S., Weatherbee J. A. , Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R. , Massague J . ; 



RT "The transforming growth factor-beta system, a complex pattern of 

RT cross -reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF.3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF.4) TO ORIGINATE FROM PIG. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There arp no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; Y00111; CAA68291.1; 

DR EMBL; M23703; AAA64616.1, 

DR EMBL; X12373 ; CAA30933.1; 

DR EMBL; AF461808; AAL57902.1; 

DR PIR; A27512; A27512. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; : 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 



DR 


ProDom; 


PD000357; 


TGFb; 


1 . 






DR 


SMART; SM00204; TGFB; 1 








DR 


PROSITE; 


PS00250; 


TGF BETA 


1; 1 . 




KW 


Growth factor; Mitogen; 


Glycoprotein; Signal ; Polymorphism. 


FT 


SIGNAL 


1 


23 




POTENTIAL. 




FT 


PROPEP 


24 


278 








FT 


CHAIN 


279 


390 




TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 




BY SIMILARITY. 




FT 


DISULFID 


293 


356 




BY SIMILARITY. 




FT 


DISULFID 


322 


387 




BY SIMILARITY. 




FT 


DISULFID 


326 


389 




BY SIMILARITY. 




FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


VARIANT 


114 


114 




L -> V. 




FT 


CONFLICT 


6 


7 




LR -> PG (IN REF. 3) 




FT 


CONFLICT 


180 


180 




R -> G (IN REF. 3) . 




FT 


CONFLICT 


237 


237 




N -> NA (IN REF . 3) . 





SQ SEQUENCE 390 AA; 44294 MW; A6E2C365 9FC3 84E6 CRC64 ; 



Query Match 91.1%; Score 624; DB 1; Length 390; 

Best Local Similarity 93.3%; Pred . No. 2.4e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiimi i in i '!i i ii .m 1 1 1 1 ii i ii, 1 1 ii 1 1; 1 1 1 1 1 , i 

Db 279 ALDTNYCFSST EKNCCWQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP1VYYVGRKPKVEQLSNMIVRSCKCS 12 0 

IIIMIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIMIIMIIIIIMIIIIIIIIII 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 6 
TGF1_SHEEP 

ID TGF1_SHEEP STANDARD; PRT; 3 90 AA . 

AC P50414; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=9 94 0; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=95121932; PubMed=7821809 ; 

RA Woodall C.J., McLaren L.J. ( Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1 . " ; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Merino; TISSUE=Skin ; 

RX MEDLINE=95268698 ; PubMed=7749621 ; 

RA Sutton R., Ward W.G. , Raphael K.A. , Cam G.R.; 

RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB: 697-705 (1995) . 

CC FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL ; X76916; CAA54242.1; 

DR EMBL; L36038; AAA31526.1; 

DR PIR; 146463; 146463. 

DR HSSP; P01137; 1KLA. 

DR InterPro; I PR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; I PRO 01 111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 142 3; TGFBETA . 

DR ProDom; PD0 0 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL . 

FT PROPEP 24 2 78 POTENTIAL. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 

FT DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 82 82 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 244 246 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 390 AA; 44291 MW; 1C247299484D0E57 CRC64; 

Query Match 91.1%; Score 624; DB 1; Length 390; 

Best Local Similarity 93.3%; Pred. No. 2.4e-59; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

1 1 1 1 1 1 1 II 1 1 MIMIMIMMIIMIIMIIIIIIIMIMIIIMIII 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 7 
TGFl_MOUSE 

ID TGFl_MOUSE STANDARD; PRT; 3 90 AA . 

AC P04202; 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-MAR-1987 (Rel. 04 , Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Mus musculus (Mouse) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86168129; PubMed=3007454 ; 

RA Derynck R., Jarrett J. A., Chen E.Y., Goeddel D.V. ; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RX MEDLINE=96096545; PubMed=8522200 ; 

RA Guron C, Sudarshan C, Raghow R. ; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . " ; 

RL Gene 165:32 5-326(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6, and NOD/LT; TISSUE=Spleen ; 

RA Poirot L . , Benoist C. , Mathis D. ; 

RT "Transforming growth factor-beta 1 sequence and expression-, no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-s ib . ch) . 

CC 

DR EMBL; M13177; AAA40423.1; -. 

DR EMBL; L42462; AAB00138.1; 

DR EMBL; L42456; AAB00138.1; JOINED. 

DR EMBL; L42457; AAB00138.1; JOINED. 

DR EMBL; L42458; AAB00138.1; JOINED. 

DR EMBL; L42459; AAB00138.1; JOINED. 

DR EMBL; L42460; AAB00138.1; JOINED. 

DR EMBL; L42461; AAB00138.1; JOINED. 

DR EMBL; AJ009862; CAA08 900.1; -. 

DR PIR; A01396; WFMS2 . 

DR HSSP; P01137; 1KLA . 

DR MGD; MGI: 98 725; Tgfbl . 

DR GO; GO:0005578; C : extracel lular ma tr ix ; IDA. 

DR GO; GO: 0006954; P : inf lamma tory response; IMP. 

DR GO; GO: 0007515; P : lymph gland development; IMP. 



DR GO; GO: 0008220; P:necrosiS; IMP . 

DR GO; GO-.0016202; P : regulation of myogenesis; IDA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGFJTGFb . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PRO 1423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 27 8 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 

FT DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 82 82 N-LINKED (GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 244 246 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 390 AA; 44310 MW; 438 1A51B711D68 9E CRC64 ; 

Query Match 90.7%; Score 621; DB 1; Length 390; 

Best Local Similarity 92.5%; Pred . No. 5e-59; 

Matches 111; Conservative 1; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

iiiiiiiiiii 1 1 1 1 1 1 j 1 1 1 1 1 1 r f 1 1 1 1 1 1 1 1 r i r 1 1 1 1 r r i f r 1 1 1 1 1 1 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

MIIMIIIIIIIMIIMIIMIMIIIIIIIIIIIIIIIIIIIMIIIMIIIMM 

Db 331 SLDTQYSKVLALYNQHNPGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 8 
TGF1_RAT 

ID TGFl^RAT STANDARD; PRT; 3 90 AA. 

AC P17246; 

DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 
OX NCBI_TaxID=101l6; 
RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=Sprague-Dawley; TISSUE=Heart ; 



RX MEDLINE=90272425; PubMed=234 9 108 ; 

RA Qian S.W., Kondaiah P . , Roberts A.B., Sporn M.B.; 

RT "cDNA cloning by PCR of rat transforming growth factor beta-1." ; 

RL Nucleic Acids Res. 18:3 059-3 059(1990). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as -its r-ont-pnt- |«? in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; X52498; CAA36741.1; -. 

DR PIR; S10219; S10219 . 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44329 


MW; 5E21108ED50D853C CRC64 ; 



Query Match 90.7%; Score 621; DB 1; Length 390; 

Best Local Similarity 92.5%; Pred. No. 5e-59; 

Matches 111; Conservative 1; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

Mlllililll llllllllllllilliiiillliillliiiiilliiiiill 



Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIVJ 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

MIIMMMIMIIMMMhlMMIIMMIIIMMMMMIMIMMMMI 

Db 331 SLDTQYSKVLALYNQHNPGASAS PCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 

RESULT 9 
TGF1_H0RSE 

ID TGF1_H0RSE STANDARD; PRT; 390 AA . 

AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15- JUL- 1998 (Rel. 36 , Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata- Ent-pl pn^nmi ; 

OC Mammalia; Eutheria ; Perissodactyla ; Equidae; Equus. 

OX NCBIJTaxID=9796; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Lymph node; 

RX MEDLINE=98 185507; PubMed-95248 19 ; 

RA Penha-Goncalves M.N. , Onions D.E., Nicolson L.; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA . " ; 

RL DNA Seq. 7:375-378(1997). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X99438; CAA67801.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; I PRO 03911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 14 23; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 



DR 


SMART; SM00204; TGFB; 1 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 1. 


KW 


Growth factor; Mitogen; 


Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL . 


FT 


PROPEP 


24 


278 


BY SIMILARITY . 


FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


SQ 


SEQUENCE 


390 AA; 


43974 MW; A86D715F44549691 CRC64 ; 



Query Match 90.5%; Score 62 0; DB 1; Length 3 90; 

Best Local Similarity 92.5%; Pred . No. 6.3e-59; 

Matches 111; Conservative 0; Mismatches 1; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I II I I I I I I I I II I I I 

Db 279 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

MIMIIIMIIIIIIMIIMIIIIIM llllllllllllllllllllllllllllll 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQVLEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 10 
TGFl_CAVPO 

ID TGFl_CAVPO STANDARD; PRT; 3 90 AA. 

AC Q9Z1Y6; Q9QZB3 ; Q9R14 8; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N., Yoshimura T.; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination."; 

KL Submitted (OCT-1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 265-382 FROM N.A. 

RX MEDLINE=99144670; PubMed=l 002 5978 ; 

RA Scarozza A.M., Ramsingh A.I., Wicher V., Wiener K. ; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues . " ; 

RL Cytokine 10:851-859(1998). 

RN [3] 



RP SEQUENCE OF 279-371 FROM N.A. 

RC STRAIN=Hartley; TI SSUE=Trachea ; 

RA Morishima Y., Uchida Y. , Nomura A., Ishii Y . , Sakamoto T. , 

RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium."; 

RL Submitted (JUL-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 

CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -!- SUBUNIT: Homodimer; disul f ide-1 inked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted- 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright . It is produced through a col 1 abor^f h nn 

CC between the Swiss Institute of Biomf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

















DR 


EMBL; AF191297; 


AAF02780 


- 1; - 






DR 


EMBL; AF097509; 


AAC83807 


• 1; - 






DR 


EMBL; AF16 934 7; 


AAD49347 


.1; - 






DR 


HSSP; P01137; 1KLA. 








DR 


InterPro; 


IPR002400; GF_ 


cysknot . 




DR 


InterPro; 


IPR003911; TGF 


TGFb. 




DR 


InterPro ; 


IPR001839; TGFb. 






DR 


InterPro; 


IPR001111; TGFb N. 






DR 


Pfam; PF00019; TGF-beta; 


1 . 






DR 


Pfam; PF00688; TGFb_propeptide ; 1. 




DR 


PRINTS; PR00438; 


GFCYSKNOT. 






DR 


PRINTS; PR01423; 


TGFB ETA 








DR 


ProDom; PD0003 57; TGFb; 


1 . 






DR 


SMART; SM00204; 


TGFB ; 1 . 








DR 


PROSITE; 


PS002 50; TGF BETA 1; 


1 . 




KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 




FT 


SIGNAL 


1 


24 




POTENTIAL. 




FT 


PROPEP 


25 


278 




POTENTIAL. 




FT 


CHAIN 


279 


390 




TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 




BY SIMILARITY. 




FT 


DISULFID 


293 


356 




BY SIMILARITY. 




FT 


DISULFID 


322 


387 




BY SIMILARITY. 




FT 


DISULFID 


326 


389 




BY SIMILARITY. 




FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


CONFLICT 


279 


279 




G - > P (IN REF. 3) . 




FT 


CONFLICT 


286 


286 




F - > S (IN REF. 2) . 




FT 


CONFLICT 


309 


309 




K -> E (IN REF. 2) . 




FT 


CONFLICT 


322 


322 




C -> R (IN REF. 2) . 





FT CONFLICT 350 350 A -> G (IN REF . 2) . 

SQ SEQUENCE 390 AA; 44328 MW; 1 53 9F84 9BA0C0FF1 CRC64 ; 



Query Match 89.3%; Score 612; DB l; Length 390; 

Best Local Similarity 92.4%; Pred . No. 4.5e-58; 

Matches 110; Conservative 0; Mismatches 1; Indels 8; Gaps 1; 
Qy 2 LDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWS 61 

llllllllll M 1 1 1 M I , . 1 1 1 ! M I I II 1 1 i 1 1 1 1 i i 1 1 1 1 . ! 1 1 1 1 1 

Db 280 LDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWS 331 

Qy 62 LDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

M I II 11 M M I M 11 1 M I M II M M M I M II 1 1 M I M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

Db 332 LDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKAKVEQLSNMI VRSCKCS 3 90 



RESULT 11 
TGF1 CHICK 



ID TGF1__CHICK STANDARD ; PRT; 373 AA. 

AC P09531; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFB1. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=24 64 13 1 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol . Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=92357039; PubMed= 1 3 53 8 6 0 ; 

RA Burt D.W., Jakowlew S.B.; 

RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA . " ; 

RL Mol. Endocrinol. 6:989-992(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license^isb-sib. ch) . 

CC 

DR EMBL; M31160; AAB05637.1; 

DR PIR; A41918; A41918. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS ; PR01423; TGFB ETA . 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


NON_TER 


1 


1 




FT 


SIGNAL 


<1 


1 


POTENTIAL. 


FT 


PROPEP 


2 


259 


POTENTIAL . 


FT 


CHAIN 


260 


373 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


266 


277 


BY SIMILARITY. 


FT 


DISULFID 


276 


339 


BY SIMILARITY. 


FT 


DISULFID 


305 


370 


BY SIMILARITY. 


FT 


DISULFID 


309 


372 


BY SIMILARITY. 


FT 


DISULFID 


338 


338 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


54 


54 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


109 


109 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


153 


153 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


224 


226 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA; 


42634 


MW; 9903F3479C8552E5 CRC64 ; 



Query Match 78.8%; Score 54 0; DB 1; Length 3 73; 

Best Local Similarity 79.2%; Pred. No. 2.2e-50; 

Matches 95; Conservative 8; Mismatches 9; Indels 8; Gaps 2; 
Qy 2 LDTNYCFS-STDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 6 0 

MMIi II Illllll lllllllll llllllllll lllhlllllll 

Db 261 LDTDYCFG PGTD EKNCCVRPLYI DFRKDLQWKWI HEPKGYMANFCMGPCPYIW 313 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I ' ' , M I I I .' I I ! I II ' I II I II I.' I I H I I I : I I I I I :||||Mhlhllll 
Db 314 SADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPI I YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
TGF1_XENLA 

ID TGF1_XENLA STANDARD; PRT; 382 AA . 

AC P16176; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 



OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=901100 90; PubMed=22 9560 1 ; 

RA Kondaiah P . , Sands M.J., Smith J.M., Fields A., Roberts A.B., 

RA Sporn M.B., Melton D.A. ; 

RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis."; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D., Kondaiah P.; 

RL Submitted (AUG- 1997) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DT FFERENTT ATTOM 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY : Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense®isb-s ib . ch) . 

CC 

DR EMBL; J05180; AAA49968.1; 

DR EMBL; AF009335; AAB64441.1; -. 

DR EMBL; AF009331; AAB64441.1; JOINED. 

DR EMBL; AF009332; AAB64441.1; JOINED. 

DR EMBL; AF009333; AAB64441.1; JOINED. 

DR EMBL; AF009334; AAB64441.1; JOINED. 

DR PIR; A34929; B61036. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438 ; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 21 POTENTIAL. 

FT PROPEP 22 270 

FT CHAIN 271 382 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 277 286 BY SIMILARITY. 

FT DISULFID 285 348 BY SIMILARITY. 

FT DISULFID 314 379 BY SIMILARITY. 

FT DISULFID 318 381 BY SIMILARITY. 



FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


123 


123 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


166 


166 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


234 


236 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA; 


44200 MW; 


1034621C917AAE15 CRC64 ; 


Query Match 




71 . 7%; 


Score 4 91; DB 1; Length 382; 



Best Local Similarity 73.0%; Pred. No. 3.9e-45; 

Matches 84; Conservative 10; Mismatches 13; Indels 8; Gaps 1; 

Qy 6 YCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQ 65 

Ml = Illh M I : ! I 1 M I I I II M M I I Ihlll lllllhlll 

Db 276 YCFGNNG PNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQ 327 

Qy 66 YS KVLALYNQHNPGASAAPCCVPQALE PL P I VY Y VGRKPKVEQLSNM I VRS CKCS 120 

Illlhlllhlllll h In lllllhlllll I II I I I I !: M II II 
Db 328 YSKVLSLYNQNNPGAS I SPCCVPDVLEPLPI I YYVGRTAKVEQLSNMVVRSCNCS 



RESULT 13 
TGF3_MOUSE 

ID TGF3_MOUSE STANDARD; PRT; 410 AA. 

AC P17125; 

DT 01-AUG-1990 (Rel. 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90190650; PubMed=262873 0 ; 

RA Miller D.A. , Lee A., Matsui Y., Chen E.Y., Moses H.L. , Derynck R . ; 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues."; 

RL Mol . Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714; PubMed=22 0655 6 ; 

RA Denhez F. , Lafyatis R., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3 . " ; 

RL Growth Factors 3:139-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -\- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Biomf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license^isb-sib. ch) . 

CC 

DR EMBL; M32745; AAA4 0422.1; -. 

DR PIR; A41397; A41397. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI: 98 727; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; I PRO 03911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 



DR 


PRINTS; 


PR00438 ; 


GFCYSKNOT . 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 


1. 




DR 


SMART ; SM00204; TGFB; 1 






DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 


FT 


SIGNAL 


1 


23 




POTENTIAL. 


FT 


PROPEP 


24 


298 




POTENTIAL . 


FT 


CHAIN 


299 


410 




TRANSFORMING GROWTH FACTOR BETA 3 . 


FT 


DISULFID 


305 


314 




BY SIMILARITY. 


FT 


DISULFID 


313 


376 




BY SIMILARITY. 


FT 


DISULFID 


342 


407 




BY SIMILARITY. 


FT 


DISULFID 


346 


409 




BY SIMILARITY. 


FT 


DISULFID 


375 


375 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


259 


261 




CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


410 AA; 46884 MW ; 


250F7048CA432BD6 CRC64 ; 


Query Match 




70. 


.9%; 


Score 4 86; DB 1; Length 410; 



Best Local Similarity 71.7%; Pred. No. 1.5e-44; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIM = hlllll 1 1 II I h 1 1 1 1 1 h 1 1 1 1 1 h 1 1 1 1 MMh 

Db 299 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 350 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

I 1 1 : 1 1 1 III 1 1 i : I ■ 1 1 1 1 1 1 I 1 1 1 hlllll 1 1 M 1 1 II h h M I M 

Db 351 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



RESULT 14 
TGF3_HUMAN 

ID TGF3_HUMAN STANDARD; PRT; 412 AA . 

AC P10600; 

DT 01-JUL-1989 (Rel . 11, Created) 

DT 01-JUL-1989 (Rel. 11, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 



OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88263019; PubMed=3 164476 ; 

RA ten Dijke P . , Hansen P . , Iwata K. , Pieler C. , Foulkes J.G.; 

RT "Identification of another member of the transforming growth factor 

RT type beta gene family."; 

RL Proc. Natl. Acad. Sci. U.S.A. 85:4715-4719(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta ; 

RX MEDLINE=8 90 9112 0 ; PubMed=32 0874 6 ; 

RA Derynck R., Lindquist P.B., Lee A., Wen D. , Tamm J . , Graycar J.L., 

RA Rhee L. , Mason A.J., Miller D.A., Coffey R.J,, Moses H.L., Chen E.Y.; 

RT "A new type of transforming growth factor-beta .. T^F-hpt-* " ; 

RL EMBO J. 7:3737-3743(1988). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Madan A., Rowen L . , Qin S., Dickhoff R., Shaffer T. , James R., 

RA Abbasi N., Loretz C. , Madan A., Dors M., Dahl T. , Hall J. , Lasky S., 

RA Hood L . ; 

RT "Complete genomic sequence of human transforming growth factor-beta 

RT 3 . " ; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Rieder M.J., Livingston R.J., Braun A.C., Montoya M.A., Chung M.-W., 

RA Miyamoto K.E., Nguyen CP., Nguyen D.A., Poel C.L., Robertson P.D., 

RA Schackwitz W.S., Sherwood J.K., Witrak L.A. , Nickerson D.A. ; 

RL Submitted (AUG-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS) OF 301-412. 

RX MEDLINE=96416253; PubMed=88 1 9 1 59 ; 

RA Mittl P.R., Priestle J. P., Cox D.A. , McMaster G. # Cerletti N . , 

RA Grutter M.G. ; 

RT "The crystal structure of TGF-beta 3 and comparison to TGF-beta 2: 

RT implications for receptor binding."; 

RL Protein Sci. 5:1261-1271(1996). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENES I S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

cc 

DR EMBL; J03241; AAA61161.1; -. 

DR EMBL; X1414 9; CAA32362.1; -. 

DR EMBL; X14885; CAA33 024.1; ALT INIT. 



DR 


EMBL; X14886; CAA33024.1; 


JOINED . 


DR 


EMBL; X148 


87; CAA33024.1; 


JOINED. 


DR 


EMBL; X148 


88; CAA33024.1; 


JOINED. 


DR 


EMBL; X14889; CAA33024.1; 


JOINED. 


DR 


EMBL; X14890; CAA33024.1; 


JOINED. 


DR 


EMBL; X148 


91; CAA33024.1; 


JOINED. 


DR 


EMBL; AF107885; AAC79727. 


1; -. 


DR 


EMBL; AY140241; AAM96819. 


1; -. 


DR 


PIR; A36169; A36169. 




DR 


PDB; 1TGJ; 


ll-JAN- 


97. 




DR 


PDB; 1TGK; 


12-MAR- 


97. 




DR 


PDB; 1KTZ; 


27-FEB- 


02 . 




DR 


Genew; HGNC: 1176 9; 


TGFB 3 . 




DR 


MIM; 190230; 






DR 


GO; GO:0005160; F: 


transforming growth factor-beta receptor li. 


DR 


GO; GO: 0007267; P: 


cell-cell signaling; TAS . 


DR 


GO; GO:0007397; P: 


histogenesis and organogenesis; TAS. 


DR 


GO; GO:0007165; P: 


signal 


transduction; TAS . 


DR 


InterPro; 


IPR002400; GF_cysknot . 


DR 


InterPro; 


IPR003911; TGF_ 


TGFb . 


DR 


InterPro ; 


IPR001839; TGFb. 


DR 


InterPro; 


IPR001111; TGFb 


_N. 


DR 


Pfam; PF00019; TGF-beta; 


1 . 


DR 


Pfam; PF00688; TGFb propeptide; 1. 


DR 


PRINTS; PR00438; GFCYSKNOT . 


DR 


PRINTS; PRO 1423; TGFBETA. 




DR 


ProDom; PD0 0 03 57; 


TGFb; 1 




DR 


SMART; SM002 04; TGFB; 1. 




DR 


PROSITE; PS00250; 


TGF BETA 1; 1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 


FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


PROPEP 


21 


300 




FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


307 


316 




FT 


DISULFID 


315 


378 




FT 


DISULFID 


344 


409 




FT 


DISULFID 


348 


411 




FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


STRAND 


303 


303 




FT 


HELIX 


304 


309 




FT 


STRAND 


316 


318 




FT 


STRAND 


321 


323 




FT 


HELIX 


324 


328 




FT 


TURN 


331 


332 




FT 


STRAND 


333 


335 




FT 


STRAND 


338 


340 




FT 


STRAND 


343 


345 




FT 


TURN 


350 


353 




FT 


STRAND 


354 


354 




FT 


HELIX 


357 


368 




FT 


TURN 


370 


371 




FT 


STRAND 


378 


380 




FT 


STRAND 


383 


392 





FT TURN 3 93 3 94 

FT STRAND 3 95 4 06 

FT STRAND 4 08 412 

SQ SEQUENCE 412 AA; 47328 MW; 3CAD354 8D3AEA1 78 CRC64 ; 

Query Match 70.9%; Score 486; DB 1; Length 412; 

Best Local Similarity 71.7%; Pred. No. 1.5e-44; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIW 60 

I Ml MM = h Mill II I llh II I II hi II I I hllll MMh 

Db 3 01 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 3 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I II • I II III II Mhllllll MM hlllll 1 1 1 ! 1 1 - 1 1 : 1 ' 1 1 ' 1 1 

Db 353 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 15 
TGF3_RAT 

ID TGF3_RAT STANDARD ; PRT; 412 AA. 

AC Q07258; 

DT 01-OCT-1994 (Rel . 30, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 OR TGF-B3 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus . 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE=Lung; 

RX MEDLINE=95155340; PubMed=78 52342 ; 

RA Wang J., Kuliszewski M. , Yee W., Sedlackova L. , Xu J., Tseu I., 

RA Post M. ; 

RT "Cloning and expression of glucocorticoid- induced genes in fetal rat 

RT lung fibroblasts. Transforming growth factor-beta 3 . " ; 

RL J. Biol. Chem. 270:2722-2728(1995). 

RN [2] 

RP SEQUENCE OF 159-213 FROM N.A. 

RX MEDLINE=93286190; PubMed=85 09457 ; 

RA McKinnon R.D., Piras G., Ida J., Dubois-Dalq M. ; 

RT "A role for TGF-beta in oligodendrocyte differentiation."; 

RL J. Cell Biol. 121:1397-1407(1993). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to license$isb-sib . ch) . 

cc 

DR EMBL; U03491; AAA67915.1; -. 

DR EMBL; X71903; CAA50722 . 1 ; 

DR PIR; A55706; A55706. 

DR HSSP; P10600; 1TGJ. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; I PRO 03 9 11 ; TGFJTGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; I PRO 01111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SMOG 2 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


300 


POTENTIAL. 


FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3 . 


FT 


DISULFID 


307 


316 


BY SIMILARITY. 


FT 


DISULFID 


315 


378 


BY SIMILARITY. 


FT 


DISULFID 


344 


409 


BY SIMILARITY. 


FT 


DISULFID 


348 


411 


BY SIMILARITY. 


FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


412 AA; 


47116 


MW; 24FD7D8 9 909 0AA9D CRC64 ; 



Query Match 70.8%; Score 485; DB 1; Length 412; 

Best Local Similarity 71.7%; Pred. No. 1.9e-44; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 ALDTNYCFSSTDYKDDDDKEKNCOmQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

Illlllll : hlllll lllllhlllllhlllllhllll |||||: 

Db 301 ALDTNYCFRNL EENCCVRPLYI DFRQDLGWKWVHEPKGYYANFCSGPCPYLR 352 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I II :| M Ml M llhllllll MM hlllll llllllllhhillll 
Db 3 53 SSDTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



Search completed: October 28, 2 003, 09:08:40 
Job time : 3.4052 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 28, 2003, 07:50:55 ; Search time 9.6208 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-10-017-372E-15 
685 

1 ALDTNYCFSSTDYKDDDDKE 



GRKPKVEQLSNMIVRSCKCS 12 0 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database SPTREMBL_2 3 : * 

1 : sp_archea : * 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5 : sp_invertebrate : * 

6 : sp_mammal : * 

7 : Sp_mhc : * 

8: sp_organelle: * 

9 : sp_phage : * 

10: spjplant:* 

11: sp_rodent : * 

12: sp_virus:* 

13 : sp_vertebrate: * 

14 : sp_unclassif ied : * 

15: sp_rvirus:* 

16 : spjoacteriap : * 

17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


624 


91 


1 


112 


6 


002730 


002730 oryctolagus 


2 


621 


90 


7 


368 


11 


Q8R4D9 


Q8r4d9 sigmodon hi 


3 


612 


89 


3 


390 


6 


Q9TUM8 


Q9tum8 equus cabal 


4 


609 


88 


9 


130 


11 


Q08714 


Q08714 mesocricetu 


5 


606 


88 


5 


124 


6 


Q95N80 


Q95n80 canis famil 


6 


540 


78 


8 


101 


11 


Q9R184 


Q9rl84 meriones un 


7 


486 


70 


9 


362 


11 


Q99K17 


Q99kl7 mus musculu 


8 


486 


70 


9 


412 


11 


Q91YU7 


Q91yu7 mus musculu 


9 


478 


69 


8 


382 


13 


093449 


093449 oncorhynchu 


10 


466 


68 


0 


255 


11 


Q921T1 


Q921tl mus musculu 



1 1 


466 


68 


. 0 


414 


11 


Q91VP5 


Q91vp5 mus musculu 


lz 


464.5 


67 


. 8 


200 


13 


Q90YF1 


Q90yfl pleuronecte 


J. J 


463.5 


67 


. 7 


379 


13 


P"\ n Trmr 

Q8 JHF5 


Q8jhf5 sparus aura 


1 A 


462.5 


67 


. 5 


382 


13 


Q9PWA9 


Q9pwa9 morone chry 


15 


459.5 


67 


. 1 


379 


13 


Q8AXK8 


Q8axk8 sparus aura 


J. b 


4 54 


66 


-> 

. 3 


224 


1 1 


Q8CDZ9 


Q8cdz9 mus musculu 


1 / 


436 


63 


. 6 


376 


13 


ps p t— \ m p\ 

Q9PTQ2 


Q9ptq2 cyprinus ca 


1 o 

1 o 


424 


6 1 


p 
. 9 


361 


13 


p\ r\ r> n r~ A 

Q98854 


Q98854 cyprinus ca 


iy 


420 


6 1 


. 3 


399 


1 1 


P^\ P n nni 

Q9ERB7 


Q9erb7 mesocricetu 


20 


413 


60 


. 3 


88 


13 


Q90YF5 


Q90yf5 pleuronecte 


z 1 


4 02 


58 


. 7 


n p 

88 


13 


Q90YF7 


Q9 0yf7 oncorhynchu 


o o 
zz 


~> p> — 7 

3 97 


58 


p, 
. 0 


88 


13 


P"\ P P f~7 T""< '"J 

Q90ZE7 


Q90ze7 acipenser b 


23 


393 


57 


. 4 


87 


13 


0423 06 


042306 carassius a 


O A 
Z 4 


TOT 

3 oz 


DO 


. 8 


9 1 


6 


Q9MYZ1 


Q9myzl capra hircu 


ZD 


373 


54 


. 5 


86 


6 


Q2824 1 


Q28241 cervus elap 


Z D 


JJO 


DZ 


. 3 


O 1 
O 1 


b 


yyNib3 


Q9nls3 capreolus c 


Z / 


C ") 

J 3 J 


C 1 


. b 


i7y 


13 


yy u Yr z 


Q90yf2 pleuronecte 


n o 
Z O 


317 


4 b , 


. 3 


77 


13 


O90YF8 




z y 


o r\ i 
3 U 1 


43 . 


. 9 


62 


13 


Q9 0ZJ7 


Q90zj7 anguilla an 


J U 


3 00 


43 


o 

. 8 


p- o 

62 


13 


Q90YF4 


Q90yf4 pleuronecte 


J 1 


TOO 

289 


42 


. 2 


50 


6 


Q2 824 0 


Q28240 cervus elap 


3z 


O Q *7 

Z o / 


4 1 , 


. y 


62 


13 


Q9DEP5 


Q9dep5 scophthalmu 


J J 


o o o 
Zo3 


4 1 . 


. 3 


62 


13 


/^\ A A O T n 

Q90ZJ8 


Q90zj8 anguilla an 


"3 A 
34 


Zo3 


A ~\ 

4 1 . 


-> 

. 3 


62 


13 


/~s pi p t r t— « p\ 

Q90YF9 


Q9 0yf9 oncorhynchu 


J b 


z 73 


3 9 . 


. 9 


p- <-> 

62 


13 


/™\ Pi p t r n o 

Q90YF3 


Q90yf 3 pleuronecte 


3 D 


one: c 
z U d . b 


3 U . 


. (J 


"3 O O 

3 o y 


13 


PN P P \J\7 Pt 

yy U Y YU 


Q90yy0 ictalurus p 


J / 


one: 

Z U D 


O Q 

z y . 


Q 

. y 


4z b 


4 


r\ QUO D A 


Q9hbp0 homo sapien 


38 


204 


29. 


, 8 


424 


13 


Q98SP9 


Q98sp9 meleagris g 


39 


203 . 5 


29. 


, 7 


385 


13 


Q90W05 


Q90w05 sparus aura 


40 


202 .5 


29. 


, 6 


374 


13 


Q8JFS0 


Q8jfs0 brachydanio 


41 


201 


29 . 


3 


119 


6 


Q95N79 


Q95n7 9 ailuropoda 


42 


201 


29. 


3 


119 


6 


Q95KP2 


Q95kp2 ursus malay 


43 


200.5 


29. 


3 


373 


13 


Q90ZD2 


Q90zd2 oncorhynchu 


44 


200.5 


29. 


3 


373 


13 


Q90ZD1 


Q9 0zdl oncorhynchu 


45 


200.5 


29. 


3 


373 


13 


Q9DDI8 


Q9ddi8 salmo salar 



ALIGNMENTS 



RESULT 1 
002730 

ID 002730 PRELIMINARY; PRT; 112 AA. 

AC 002730; 097501; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01 -JUL- 1997 (TrEMBLrel . 04, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA- 1. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha ; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RA Taylor T.K., James E.R., McGonigle S., Yoho E.R.; 

RL Submitted (APR- 19 97) to the EMBL/ GenBank/ DDB J databases. 



RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y. , Kodama T. ; 

RL Submitted (NOV- 1998 ) to the EMBL/GenBank / DDB J databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER , DISULFIDE -LINKED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF000133; AAB53806.1; 

DR EMBL; AB020217; BAA36950.1; 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; I PRO 0183 9; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


CHAIN 


1 


112 


TRANSFORMING GROWTH FACTOR BETA 1 


FT 


DISULFID 


7 


16 


BY SIMILARITY. 


FT 


DISULFID 


15 


78 


BY SIMILARITY. 


FT 


DISULFID 


44 


109 


BY SIMILARITY. 


FT 


DISULFID 


48 


111 


BY SIMILARITY. 


FT 


DISULFID 


77 


77 


INTERCHAIN (BY SIMILARITY) . 


FT 


CONFLICT 


2 


3 


LD -> FS (IN REF. 2) . 


FT 


CONFLICT 


85 


92 


PLPIVYYV -> ATAHRVTTL (IN REF. 2) 


SQ 


SEQUENCE 


112 AA; 


12795 


MW; 53C5B7D46355A6F3 CRC64 ; 



Query Match 91.1%; Score 624; DB 6; Length 112; 

Best Local Similarity 93.3%; Pred. No. 4.6e-65; 

Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

Illllllllll lllllll Ml MIlIII lllilll.lllillll III 

Db 1 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

lllllillllllllllMMIIIIMIIIMIIIIMIIMMIIIIMIIIIIIIMII 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRSCKCS 112 



RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT; 368 AA . 

AC Q8R4D9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 
DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 
DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 
DE Transforming growth factor beta-1 protein (Fragment) . 
GN TGFB1 . 

OS Sigmodon hispidus (Hispid cotton rat) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Mundae; Sigmodont mae ; 

OC Sigmodon. 

OX NCBI_TaxID=42415; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Blanco J.C., Pletneva L.M., Prince G.A.; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

CC -i- SIMILARITY ; BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF480858; AAL87199.1; 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT N0N_TER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C9 12 07B04 68B4A CRC64 ; 

Query Match 90.7%; Score 621; DB 11; Length 368; 

Best Local Similarity 92.5%; Pred . No. 4e-64; 

Matches 111; Conservative 1; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 r I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 257 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 3 08 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 1 1 1 1 1 1 ^ 1 1 1 1 1 i 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 309 SLDTQYSKVLALYNQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 368 



RESULT 3 
Q9TUM8 

ID Q9TUM8 PRELIMINARY; PRT; 3 90 AA. 

AC Q9TUM8 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

GN TGFB1 . 

OS Equus cabal lus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata,- Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Nixon A.J., Brower-Toland B.T., Sandell L.J.; 

RT "Molecular cloning of equine transforming growth factor beta 1 reveals 

RT a divergent nucleotide structure that encodes a novel bioactive 

RT peptide among mammalian species."; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 



DR EMBL; AF175709; AAD49431.1; 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 9 11; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

SQ SEQUENCE 390 AA; 43860 MW; 220FE4 0DFCCA601 6 CRC64; 

Query Match 89.3%; Score 612; DB 6; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.8e-63; 

Matches 110; Conservative 0; Mismatches 2; Intfel s ft; n^pc i ; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIIIM Ml MIIMIMIIIIIIIMIMIIMIIIMIIMMIIMI 

Db 279 ALDTNYCSSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 330 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 120 

1 1 1 1 1 1, 1 1 1 1 1 II I II, 1 1 1 1 1 il I ! , MIIIIMIIIIIIIIIMIIIIIIIMII 

Db 331 SLDTQYSKVLALYNQHNPGASAAPCCVPQVLEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 4 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA . 

AC Q08714; 070331; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI_TaxID=1003 6; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=8 3 17544 ; 

RA Wong D.T., Donoff R.B., Yang J., Song B.Z., Matossian K. , Nagura N. , 

RA Elovic A., McBride J., Gallagher G. , Todd R. ; 

RT "Sequential expression of transforming growth factors alpha and beta 1 

RT by eosinophils during cutaneous wound healing in the hamster. "; 

RL Am. J. Pathol. 143:130-142(1993). 

RN [2] 

RP SEQUENCE OF 26-115 FROM N . A . 

RC STRAIN=SYRIAN; TI SSUE=SPLEEN ; 

RX MEDLINE=98234044 ; PubMed=9573 100 ; 

RA Melby P.C., Tryon V.V., Chandrasekar B . , Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 



RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis."; 

RL Infect. Immun. 66:2135-2142(1998). 

CC -!- FUNCTION: TGF-BETA 1 ISA MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, DI SULFIDE -LINKED . 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; -. 

DR EMBL; AF046214; AAC40099.1; 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204 : TGFB: 1. 

DR PROSITE; PS0 02 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NON_TER 1 1 

FT PROPEP <1 18 

FT CHAIN 19 130 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 25 34 BY SIMILARITY. 

FT DISULFID 33 96 BY SIMILARITY. 

FT DISULFID 66 129 BY SIMILARITY. 

FT DISULFID 95 95 INTERCHAIN (BY SIMILARITY) . 

FT CONFLICT 93 93 G - > S (IN REF . 2) . 

SQ SEQUENCE 130 AA; 14997 MW; 8B4 1DD6CF3 9CCA77 CRC64 ; 

Query Match 88.9%; Score 60 9; DB 11; Length 13 0; 

Best Local Similarity 91.7%; Pred. No. 3.1e-63; 

Matches 110; Conservative 0; Mismatches 2; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

IIIIIMIIII 1 1 II 1 1 1 1! I M II 1 1 1 M; II 1 1 1 1 1 Ml I h I M 1 1 II 

Db 19 ALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 70 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I M 1 M 1 1 ! 1 1 II II II M II I 1 1 1 1 1 1 1 1 1 1 M M 1 11 1 M M M 1 M 1 1 M M Ml 

Db 71 SLDTQYSKVLALYNQHNPGASAGPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSYKCS 130 

RESULT 5 
Q95N80 

ID Q95N80 PRELIMINARY; PRT; 124 AA . 

AC Q95N80; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 



RA Fonfara S., Groene A., Baumgaertner W. ; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 

RT cells."; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF349538; AAK54072.1; 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta ; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

FT NONJTER 124 124 

SQ SEQUENCE 124 AA; 14329 MW; 21D185218E5556DB CRC64 ; 

Query Match 88.5%; Score 606; DB 6; Length 124; 

Best Local Similarity 93.2%; Pred . No. 6.6e-63; 

Matches 109; Conservative 0; Mismatches 0; Tndels ft; Gape; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 6 0 

IMMIIIIII IIMIIIIIIIMIIIMMIIiiMIIIMIIIIIIIMI 

Db 16 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI W 67 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSC 117 

IIIIIIIIIMIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIII 
Db 68 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSC 124 

RESULT 6 
Q9R184 

ID Q9R184 PRELIMINARY; PRT; 101 AA . 

AC Q9R184; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13 , Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA. 

OS Meriones unguiculatus (Mongolian jird). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Gerbillinae; 

OC Meriones. 

OX NCBI_TaxID=10047; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=LYMPH NODE; 

RA Rao U.R. , Klei T.R. ; 

RT "cDNA cloning of gerbil transforming growth factor-beta by PCR . " ; 

RL Submitted (JUN-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -I- SUBUNIT: HOMODIMER, DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF161218; AAD45726.1; 



DR HSSP; P01137; 1KIA. 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


CHAIN 


<1 


>101 


TRANSFORMING GROWTH FACTOR 


FT 


DISULFID 


1 


10 


BY SIMILARITY. 


FT 


DISULFID 


9 


72 


BY SIMILARITY. 


FT 


DISULFID 


71 


71 


INTERCHAIN (BY SIMILARITY) 


FT 


NONJTER 


101 


101 




SQ 


SEQUENCE 


101 AA; 


11724 


MW; ABF1CFDA2 64AEFED CRC64 ; 



Query Match 78.8%; Score 540; DB 11; Length 101; 

Best Local Similarity 89.0%; Pred. No. 2.7e-55; 

Matches 97; Conservative 1; Mismatches 3; Indels ft; n^n^ i 
Qy 7 CFSSTDYKDDDDKEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQY 66 

mil iiiiiiiiii j 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 r 1 1 r 1 1 1 1 r 1 1 1 r i in 

Db 1 CFSST EKNCCVRQLYRDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLHTQY 52 

Qy 67 SKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVR 115 

1 1 II 1 1 1 1 II 1 1 1 1 1 h 1 1 1 M M 1 1 II II 1 1 1 II I ! 1 1 1 1 1 1 1 M 1 1 

Db 53 SKVLALYNQHNPGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMFW 101 



RESULT 7 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA. 

AC Q99K17; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R . ; 

RL Submitted (MAR-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; -. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFbjpropept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PRO 1423; TGFB ETA . 



DR ProDom; PD0003 57; TGFb; 1. 

DR SMART ; SM002 04; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 08 08E461 8 0FDAE70 CRC64 ; 



Query Match 70.9%; Score 486; DB 11; Length 362; 

Best Local Similarity 71.7%; Pred. No. 2.3e-48; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

MINIM = MM MM I M M MM M M MM M M MM M I MMM 

Db 251 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 302 

Qy 61 SLDTQYSKVT.ALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I M M M Ml M M M II I II I MM MM 1 1 II M I M M I MM M I M I 

Db 3 03 SADTTHSTVLGLYNTLNPEASAS PCCVPQDLE PLTI LYYVGRTPKVEQLSNM WKSCKCS 362 



RESULT 8 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA. 

AC Q91YU7; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/ DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC014690; AAH14690.1; -. 

DR MGD; MGI: 98727; Tgfb3 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 9 11; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PRO 14 23; TGFB ETA . 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3EB65D04 6DF32AD CRC64 ; 

Query Match 70.9%; Score 486; DB 11; Length 412; 

Best Local Similarity 71.7%; Pred. No. 2.6e-48; 

Matches 86; Conservative 11; Mismatches 15; Indels 8; Gaps 1; 



Qy 



1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 



II I Mill hlllll MlllhlMMhMMIhMM III II-. 

Db 301 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLR 3 52 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I II M II III II llhllllll I I I I hlllll llllllllhhMlil 
Db 3 53 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 9 
093449 

ID 093449 PRELIMINARY; PRT; 382 AA . 

AC 093449; Q91217; 

DT 01-NOV-1998 (TrEMBLrel . 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo qairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; 

0C Protacanthopterygii ; Salmoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8 022; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE= 9 92 4 2020; PubMed= 10227481; 

RA Daniels G.D., Secombes C.J.; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA . " ; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=98390168; PubMed=9722928 ; 

RA Hardie L.J., Laing K.J., Daniels G.D., Grabowski P.S., Cunningham C. , 

RA Secombes C.J.; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE . MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE- LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ007836; CAA07707.1; -. 

DR EMBL; X99303; CAA67685.1; 

UK HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 



DR 


ProDom; PD000357; 


TGFb; 


1 . 




DR 


SMART; SM002 04; TGFB; 1 






DR 


PROSITE; 


PS00250; 


TGF BETA_1; 


1 . 


KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 


FT 


S I GNAL 


1 


20 




POTENTIAL . 


FT 


PROPEP 


2 1 


270 






FT 


CHAIN 


271 


"5 O 

382 




I KANSr URMINCj LjRUWIH FAL1UR BE1A. 


FT 


DISULFID 


278 


286 




BY SIMILARITY. 


FT 


DISULFID 


285 


348 




BY SIMILARITY. 


FT 


DISULFID 


J 14 


379 




DV CTMTT ADTTV 
DI o 1 1Y1 1 J_iAK lit. 


FT 


DISULFID 


318 


381 




BY SIMILARITY. 


FT 


DISULFID 


347 


347 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 




N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


116 


116 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CONFLICT 


237 


237 




N -> D (IN REF. 2) . 


FT 


CONFLICT 


345 


345 




Q -> H (IN REF . 2) . 


FT 


CONFLICT 


371 


372 




LS -> VP (IN REF. 2) . 


FT 


CONFLICT 


377 


377 




K -> M (IN REF . 2) . 


SQ 


SEQUENCE 


382 AA; 


44136 MW; 


93BD4D3540084B92 CRC64 ; 


Query Match 




69 


8%; 


Score 478; DB 13; Length 382; 



Best Local Similarity 76.7%; Pred. No. 2.1e-47; 

Matches 79; Conservative 13; Mismatches 11; Indels 0; Gaps 



0 ; 



Qy 
Db 



18 DKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 77 

II ::||||:|IIIIIIIIIIMIIII I I IhhIII llh = Uhlllll I I 
28 0 DKSESCCVRKLYIDFRKDLGWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHHN 33 9 



Qy 

Db 



78 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

Mill MINI lllllhllllh I M 1 III I Ih ! hi , 
340 PGASAQPCCVPQVLEPLP I I Y Y VGRQHKVEQLSNM I VKSCRCS 382 



RESULT 10 
Q921T1 

ID Q921T1 PRELIMINARY; PRT; 255 AA. 

AC Q921T1; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=1009 0; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R . ; 

RL Submitted (JUL-2001) to the EMBL/ GenBank / DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF -BETA FAMILY. 

DR EMBL; BC011055; AAH11055.1; -. 

DR MGD; MGI: 98 72 6; Tgfb2 . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 



DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 255 AA; 29087 MW; 94 54 00 17F3C5C2 1 9 CRC64 ; 

Query Match 68.0%; Score 466; DB 11; Length 255; 

Best Local Similarity 66.7%; Pred . No. 3.3e-46; 

Matches 80; Conservative 15; Mismatches 17; Indels 8; Gaps 1; 
Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 60 

Ml Ml •• = MIM MMhMMMMMMMMMI MUM 

Db 144 ALDAAYCFRNV QDNCCLRPLYIDFKRDLGWKWI HEPKGYNANFCAGACPYLW 195 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I M MM MM II II MIM MM MM 1 = 11 = 1 MM MM MIM I III 

Db 196 SSDTQHTKVLSLYNTI NPEASASPCCVSQDLEPLTI LYYI GNTPKI EQLSNMI VKSCKCS 2 55 



RESULT 11 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA. 

AC Q91VP5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Breast tumor; 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC011170; AAH11170.1; -. 

DR MGD; MGI : 98726 ; Tgf b2 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR0 03911; TGF TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 414 AA; 47588 MW; DB37A7C3888 1F286 CRC64 ; 



Query Match 68.0%; Score 466; DB 11; Length 414; 

Best Local Similarity 66.7%; Pred. No. 5.7e-46; 

Matches 80; Conservative 15; Mismatches 17; Indels 8; Gaps 1; 



Qy 1 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIW 6 0 

Ml III = : llhl lillhhllM hlUMIII I 'Ihl 

Db 303 ALDAAYCFRNV QDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLW 354 

Qy 61 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

Mlhhlhlll II II hill I I MM hi hi Ihl MINI hi I II I 

Db 355 SSDTQHTKVLSLYNTI NPEASASPCCVSQDLEPLTI LYYIGNTPKI EQLSNMI VKSCKCS 414 

RESULT 12 
Q90YF1 

ID Q90YF1 PRELIMINARY; PRT; 200 AA . 

AC Q90YF1; 

DT 01-DEC-2001 (TrEMBLrel . 19. Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

GN TGF-BETA1. 

OS Pleuronectes platessa (Plaice) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; PI euronecti formes ; 

OC Pleuronectoidei ; Pleuronect idae ; Pleuronectes. 

OX NCBI_TaxID=8262 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Laing K.J. ; 

RL Submitted (AUG-2001) to the EMBL/ GenBank / DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20394651; PubMed=10938738 ; 

RA Laing K.J., Cunningham C. , Secombes C.J.; 

RT "Genes for three different isoforms of transforming growth factor-beta 

RT are present in plaice (Pleuronectes platessa) DNA." ; 

RL Fish and Shellfish Immunol. 10:261-271(2000). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ318933; CAC60268 . 1 ; -. 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PRO 0438; GFCYSKNOT. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT N0N_TER 1 1 

FT N0N_TER 200 2 00 

SQ SEQUENCE 200 AA; 22851 MW; 4876FEB6A263B4CC CRC64 ; 

Query Match 67.8%; Score 464.5; DB 13; Length 200; 

Best Local Similarity 67.5%; Pred. No. 3.8e-46; 

Matches 79; Conservative 15; Mismatches 14; Indels 9; Gaps 1; 
Qy 4 TNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 63 



Db 93 TDTCTAQTE TCCVRKLY I DFRKDLGWKW I HKPTGYHANYCMGSCTY I WNAE 143 

Qy 64 TQY S KVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGRKP KVEQLSNM I VRSCKCS 12 0 

lllllll I I I I I I I I I I I I M II II h IIIIIIM MINI 
Db 144 NKYSQI LALYKHHNPGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMS VKSCKCS 200 



RESULT 13 
Q8JHF5 

ID Q8JHF5 PRELIMINARY; PRT; 379 AA. 

AC Q8JHF5; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata: Vertebrata; Fur.pl pnsr nmi ; 

OC Act inopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Tafalla C. , Aranguren R. , Secombes C.J., Castrillo J.L., Novoa B., 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal."; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF510084; AAN03842.1; -. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003 911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide ; 2. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW; C0C9D3D2FCA2 9C0E CRC64 ; 

Query Match 67,7%; Score 463.5; DB 13; Length 379; 

Best Local Similarity 71.4%; Pred. No. le-45; 

Matches 80; Conservative 14; Mismatches 17; Indels 1; Gaps 1 

Qy 10 STDYKDD-DDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 68 

Ih M = = Mil II M II I M I M II I • I I I I I : I : I I MM : MM 

Db 268 STETKDTCTAQTETCC VRSLY I DFRKDLGWKWI HKPTRYHANYCMGSCTY I WNAENKYSQ 32 7 

Qy 69 VLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKP KVEQLSNM I VRS CKCS 120 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 ^ IMIMMMMMM 

Db 328 ILALYKHHNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMI VKSCKCS 379 



RESULT 14 
Q9PWA9 

ID Q9PWA9 PRELIMINARY ; PRT; 382 AA . 

AC Q9PWA9; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel . 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA. 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha ; Perci formes ; Percoidei; 

OC Moronidae; Morone. 

OX NCBI_TaxID=45352; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=KIDNEY; 

RX MEDLINE=20394636; PubMed=10938723 ; 

RA Harms C.A., Kennedy-Stoskopf S., Home W.A., Fuller F.J., 

RA Tompkins W.A.F.; 

RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M. 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 
CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS . 

CC SUBUNIT: HOMODIMER, D I SULFIDE -LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 
CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 



CC 


-!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 






DR 


EMBL; AF140363; AAD46997.1; -. 






DR 


HSSP; P01137; 1KLA . 






DR 


InterPro; IPR002400; GF_cysknot . 






DR 


InterPro; I PRO 0183 9; TGFb. 






DR 


InterPro; IPR001111; TGFb N. 






DR 


InterPro; IPR003911; TGF_TGFb. 






DR 


Pfam; PF00019; TGF-beta; 1. 






DR 


Pfam; PF00688; TGFb propeptide; 1. 






DR 


PRINTS; PR00438; GFCYSKNOT. 






DR 


PRINTS; PR01423; TGFBETA . 






DR 


ProDom; PD000357; TGFb; 1. 






DR 


SMART; SM00204; TGFB; 1. 






DR 


PROSITE; PS00250; TGF_BETA__1 ; 1. 






KW 


Growth factor; Mitogen; Glycoprotein,- Signal. 






FT 


SIGNAL 1 ? POTENTIAL . 






FT 


PROPEP ? 270 






FT 


CHAIN 271 382 TRANSFORMING GROWTH 


FACTOR BETA. 


FT 


DISULFID 278 286 BY SIMILARITY. 






FT 


DISULFID 285 348 BY SIMILARITY. 






FT 


DISULFID 314 379 BY SIMILARITY. 






FT 


DISULFID 318 381 BY SIMILARITY. 






FT 


DISULFID 347 347 INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 73 73 N-LINKED (GLCNAC. . 


.) 


(POTENTIAL) 


FT 


CARBOHYD 108 108 N-LINKED (GLCNAC . 


.) 


(POTENTIAL) 


FT 


CARBOHYD 113 113 N-LINKED (GLCNAC. . 


.) 


(POTENTIAL) 



FT CARBOHYD 124 124 N-LINKED (GLCNAC . . .) (POTENTIAL) 

FT CARBOHYD 259 259 N-LINKED (GLCNAC. . .) (POTENTIAL) 

FT SITE 235 237 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 382 AA; 43846 MW ; 3 124D8C34EA74D72 CRC64 ; 



Query Match 67.5%; Score 4 62.5; DB 13; Length 382; 

Best Local Similarity 71.4%; Pred. No. 1.3e-45; 

Matches 80; Conservative 15; Mismatches 16; Indels 1; 



Gaps 



1; 



Qy 
Db 

Qy 
Db 



10 STDYKD-DDDKEKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 68 

II : II : : I I I I I M I I I I I I I I I I I h I Ihlhhl I Mh = =lh 

271 STETKDVCTAQTETCCVRSLY I DFRKDLGWKW I HKPTGYNAN YCMGSCTY I WNAENKYSQ 33 0 

6 9 VLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

:|||| lllllll ' I I I I II I II I ' h I I Ml Mlllllllhlllll 
331 ILALYKHHNPGASAQPCCVPQALEPLPI LYYVGRQHKVEQLSNMIVKSCKCS 382 



RESULT 15 
Q8AXK8 

ID Q8AXK8 PRELIMINARY; PRT; 379 AA . 

AC Q8AXK8 ; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actmopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha ; Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBIJTaxID=8175; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C. , Novoa B., Aranguren R. , Figueras A. ; 

RT "Molecular cloning and characterization of sea bream (Sparus aurata) 

RT TGF beta 1 . " ; 

RL Submitted (SEP-2001) to the EMBL / GenBank / DDB J databases. 

DR EMBL; AF424703; AAN76665.1; -. 

SQ SEQUENCE 379 AA; 43485 MW; A306EC387F6DBA7C CRC64 ; 

Query Match 67.1%; Score 459.5; DB 13; Length 379; 

Best Local Similarity 71.4%; Pred. No. 3e-45; 

Matches 80; Conservative 13; Mismatches 18; Indels 1; Gaps 1; 

Qy 10 STDYKDD-DDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 68 

II: II : : MM I I I I I I I I I I I I I I h I I M M I M I MM : MM 

Db 268 STETKDTCTAQTETCCVR SLY I DFRKDLGWKW I HKPTRYHAN YCMGSCTY I WNAENKYSQ 327 

Qy 6 9 VLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

MM lllllll I I I I 1 I I f I I I I I ^ I I I I I ^ j I I I I I I I f I I I I I I 
Db 328 XLALYKHHNPGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMIVKSCKCS 379 



Search completed: October 28, 2003, 09:12:25 
Job time : 9.6208 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



nucleic search, using sw model 

October 27, 2003, 10:47:27 ; Search time 5261.34 Seconds 

(without alignments) 
10489.161 Million cell updates/sec 

US-10-017-372E-32 
1349 

1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 134 9 

I DENT I TY_NUC 
Gapop 10.0 , Gapext 1.0 



Searched : 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg : * 
gb_in: * 
gb_om : * 
gb_ov : * 
gb__pat : * 
gb__ph .• * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em__f un : * 
em_hum : * 
em_in : * 
em_mu : * 
em_om : * 
em_or : * 
em_ov : * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts:* 



28 


em_un : * 


29 


em vi : * 


30 


em_htg_hum: * 


31 


em_htg_inv : * 


32 


em__htg_other : * 


33 


em htg mus : * 


34 


em_htg_pln : * 


35 


em htg rod: * 


36 


em_htg_mam: * 


37 


em htg vrt : * 


38 


em_sy : * 


39 


em htgo_hum : * 


40 


em_htgo_mus : * 


41 


em htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length DB 


ID 


Description 


1 


1233 . 


2 


91 


4 


3206 


4 


PIGTGFB1A 


M23703 Sus scrofa 


2 


1223 . 


8 


90 


7 


1326 


6 


AX338213 


AX338213 Sequence 


3 


1213 . 


8 


90 


0 


1750 


4 


GGTGFB1 


X12373 Porcine mRN 


4 


1143 . 


6 


84 


8 


1605 


4 


SSTGFBR 


Y00111 Porcine mRN 


5 


1122 . 


2 


83 


2 


2221 


4 


AF461808 


AF461808 Sus scrof 


6 


1002 . 


6 


74 


3 


1369 


4 


DOGTGFB1A 


L34956 Canine tran 


7 


991 . 


6 


73 


5 


2527 


6 


E00973 


E00973 cDNA encodi 


8 


990. 


8 


73 


4 


2537 


6 


A06669 


A06669 Synthetic m 


9 


984. 


8 


73 


0 


1173 


4 


OATGFB1 


X76916 O.aries mRN 


10 


984 . 


4 


73 


0 


1780 


9 


BC000125 


BC000125 Homo sapi 


11 


984 . 


4 


73 


0 


1780 


9 


BC001180 


BC001180 Homo sapi 


12 


983 


72 


9 


1561 


9 


AGMTGFB 


M16658 Simian tran 


13 


980. 


2 


72 


7 


1821 


6 


E03028 


E03 028 DNA encodin 


14 


979. 


8 


72 


6 


1746 


9 


BC022242 


BC022242 Homo sapi 


15 


978 . 


6 


72 


5 


1560 


6 


106216 


106216 Sequence 2 


16 


977 . 


8 


72 


5 


2745 


9 


HSTGFB1 


X02 812 Human mRNA 


17 


976. 


6 


72 


4 


1560 


6 


108268 


108268 Sequence 2 


18 


959. 


2 


71 


1 


1569 


6 


106221 


106221 Sequence 3 


19 


954 . 


2 


70 


7 


1561 


6 


108275 


108275 Sequence 3 


20 


939. 


6 


69 


7 


1173 


9 


BT007245 


BT007245 Homo sapi 


21 


939. 


6 


69 


7 


1173 


12 


BT007866 


BT007866 Synthetic 


22 


930. 


4 


69 


0 


1173 


4 


ECRGFB1 


X99438 E.caballus 


23 


930. 


2 


69 


0 


1176 


6 


AX615127 


AX615127 Sequence 


24 


928 . 


6 


68 


8 


1176 


6 


AX481432 


AX481432 Sequence 


25 


928 . 


6 


68 


8 


1176 


6 


AX615128 


AX615128 Sequence 


26 


924 


68 


5 


1187 


4 


AF175709 


AF175709 Equus cab 


27 


901 . 


2 


66 


8 


1597 


10 


AF191297 


AF191297 Cavia por 


28 


872 . 


8 


64 


7 


1641 


10 


BC013738 


BC013738 Mus muscu 


29 


859 


63 


7 


1579 


10 


MUSTGFRNA 


M13177 Mouse trans 


30 


859 


63 


7 


2094 


10 


MMU009862 


AJ009862 Mus muscu 


31 


839. 


4 


62 


2 


1585 


10 


RNTGFB1 


X52498 Rat mRNA fo 


32 


827. 


6 


61 


3 


1117 


4 


BOVTGFB 


M36271 Bovine tran 


33 


803 


59 


5 


1125 


10 


AF480858 


AF4 8 08 58 Sigmodon 



34 


682. 


.2 


50 . 


. 6 


1675 


6 


I 033 1U 


103310 Sequence 1 


3 b 


654 


4 8 . 


. 5 


13 lb 


6 


Aadzod 


AAbz8b33 sequence 


36 


652 . 


. 4 


4 8 


. 4 


138 9 


6 


Aadzo d i y 


AX528619 Sequence 


o / 


591 . 


,4 




o 

. o 




c 

D 


rvA.D Z. O D J D 




38 


586. 


. 6 


43 


.5 


1350 


6 


AX528615 


AX528615 Sequence 


39 


555. 


. 6 


41 


. 2 


699 


6 


105434 


105434 Sequence 4 


40 


366. 


.4 


27. 


.2 


489 


6 


AX455100 


AX455100 Sequence 


41 


357. 


, 6 


26. 


.5 


650 


6 


AX336646 


AX336646 Sequence 


42 


357. 


6 


26. 


.5 


650 


9 


HUMTGFBA 


M38449 Homo sapien 


43 


357 . 


.6 


26 


. 5 


862 


6 


103312 


103 312 Sequence 3 


44 


339. 


.2 


25. 


. 1 


469 


10 


MATGFB1 


X60296 M.auratus m 


45 


331. 


.8 


24 . 


.6 


1256 


5 


CHKTGFB4 


M31160 Gallus gall 



ALIGNMENTS 



RESULT 1 

PIGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



gene 
CDS 



PIGTGFB1A 3206 bp mRNA linear MAM 31-MAR-1995 

Sus scrofa transforming growth factor beta-1 mRNA, complete cds . 
M23703 

M23703 .1 GI : 755044 
transforming growth factor-beta - 1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,• 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 3206) 

Kondaiah,P., Van Obberghen-Schill ing, E . , Ludwig,R.L., Dhar,R., 
Sporn,M.B. and Roberts, A. B. 

cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

Evidence for alternate splicing and polyadenylation 

J. Biol. Chem. 263 (34), 18313-18317 (1988) 

89054010 

2461367 

On Apr 1, 1995 this sequence version replaced gi: 341017. 
Original source text: Sus scrofa (strain miniature swine) cDNA to 
mRNA. 

Locat ion/Qua li f iers 
1. .3206 

/organism="Sus scrofa" 
/mol_type= n mRNA M 
/strain= "miniature swine" 
/db_xref ="taxon: 9823" 

/cell_type= n peripheral blood lymphocyte" 
1. .32 06 

/gene="TGF-beta-l" 
906. .2078 
/gene="TGF-beta-l" 
/codon_start=l 

/product = •• transforming growth factor-beta - 1 " 
/protein_id="AAA64616 . 1" 
/db_xref="GI :755045" 

/ 1 rans la t ion= " M PPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 



polyA_signal 
polyA_site 



BASE COUNT 
ORIGIN 



645 a 



KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFR 

LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 

RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 

QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 

3186. .3191 

/gene=" TGF-beta-l M 

3206 

/gene="TGF-beta-l" 

1041 c 924 g 596 t 



Query Match 91.4%; Score 1233.2; DB 4, 

Best Local Similarity 96.1%; Pred . No. 3.7e-208; 
Matches 1290; Conservative 0; Mismatches 13; 



Length 3206; 
Indels 39; Gaps 



1; 



Qy 


6 


Db 


901 


Qy 


66 


Db 


961 


Qy 


126 


Db 


1021 


Qy 


186 


Db 


1081 


Qy 


246 


Db 


1141 


Qy 


306 


Db 


1201 


Qy 


366 


Db 


1261 


Qy 


426 


Db 


1321 


Qy 


486 


Db 


1381 


Qy 


546 


Db 


1441 


Qy 


606 



CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II III M 1 1 1 1 1 1 1 1 II 1 1 h 1 1 1. I I 1 1 1 1 1 U: 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 



65 



960 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

IMIIIIIIIIIIIIMMIIIIIIIIIIMMMIIMIIIMMIIIIIMIIIIIII 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 1020 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 ; I i I'l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 II : M 1 1 1 1 1 II I M I M II I li I II 1 1 1 1! 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 108 0 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 245 

I I ' I I I 'L' I I I , i I I |, I'M 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 114 0 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 12 00 
CAGAGGCGGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGGAACC 3 65 

i ' ': m 1 1 1 ' 1 , i 1 h , 1 1 1 1 1 , 1 1, : ii i , 1 'i i 'i 1 1 1 1 1 , ii i ill ii ii iii 

C^GAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 1260 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCT^ 425 

llllll llllllll Mlllllllllllllllll Mill IIIIIIIIMI lllllllllll 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 132 0 



MM llllllll llllllll MM lllllll I 



Mill 



Ml 



MM llllllll llllllll lllllllllll Mill MM lllllll lllllllllll 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 



MINI 



Db 



1501 



ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1560 



Qy 


666 


Db 


1561 


Qy 


726 


Db 


1621 


Qy 


786 


Db 


1681 


Qy 


846 


Db 


1740 


Qy 


906 


Db 


1762 


Qy 


966 


Db 


1822 


Qy 


1026 


Db 


1882 


Qy 


1086 


Db 


1942 


Qy 


1146 


Db 


2002 


Qy 


1206 


Db 


2062 


Qy 


1266 


Db 


2122 


Qy 


1326 


Db 


2182 



1 I I , I I I I I I I I I M, I I I M .'I I I I I I I I I : I I I II I I I I II I ; I I i: '! I I I, I I 



MMII 



1 1 1 1 1 1 II M 1 1! 1 1 1 1 1 'i 1 1 II ,< 1 1 M 1 M 1 1 1 1 1 1 1 1 > 1 1 1 



MIMIIM 



I II I 



I I I I I I I M I I I I, 'I I II ; I I I I,', I I I I i I I I I ! ' II I I,' I I I I I: I 



TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

1 1 1 , i 1 1 1 1 : 1 1 1 1 M 1 1 II , 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M , I M II . I INI 1 1 1 1 

TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1881 

GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

1 1 1 ' 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 'I I N , 1 1 II, , 1 1 1 1 1 1 1 1 1 1 'II 1 1, 1 1 1 1 1, II I II 

GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1941 

ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

II I . I I I I I ' I I I I I, I I I M I I I I I I I I I I I I I I Ml ' II ll I I II II; I I I I 1 1 I I I i 

ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 2 0 01 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

ii 1 1 'i 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 , 1 1 1 1 1 , 1 1 1 1: 1 1 1 1 , 1 1 1 ii 1 1 ii 'i 1 1 1 1 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 2 061 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIII 



1 1 1 1 



II . II 1, 1 1 II 1 1 M 1 1 



III Mill III I 



II 



RESULT 2 
AX338213 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 



AX338213 
Sequence 
AX338213 
AX338213 . 



1326 bp 
1 from Patent WO0181404. 

1 GI:18128750 



DNA 



linear PAT 09-JAN-2002 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

1 

Strober,W., Nakamura,K., Kitani,A. and Fuss, I. J. 

Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

Patent: WO 0181404-A 1 01-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
Location/ Qua 1 i f iers 
1. -1326 

/ organism= " Sus scrofa 11 
/mol_type= "genomic DNA" 
/db_xref = 11 1 axon: 9823" 
16. .1188 

/note= "unnamed protein product" 
/codon_start=l 
/protein_id="CAD20538 . 1" 
/db_xref = "GI : 18128751" 

/ 1 rans 1 a t i on= " MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEI NGFNSGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVTALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
263 a 438 c 392 g 233 t 



Query Match 90 . 7%; 

Best Local Similarity 96.4%; 
Matches 1299; Conservative 



Score 1223.8; DB 6; 
Pred. No. 1.9e-206; 
0; Mismatches 7; 



Length 132 6; 
Indels 41; Gaps 



Qy 


i 


Db 


6 


Qy 


61 


Db 


66 


Qy 


121 


Db 


126 


Qy 


181 


Db 


186 


Qy 


241 


Db 


246 


Qy 


301 


Db 


306 



TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 60 

Mill. M ' 1 1 1 1 ' 1 1 1 1 , I ! ' 1 1 , . ' 1 1 1 , , l j 1 1 

TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 6 5 
GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 0 

Ml MM ILIIIMIIIIIILIIIIIII lllllllllllllirillllllllil 

GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 5 
CATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCT 18 0 

MMMMMMMMMMMMMMMMMMMIMMMMMMMMMMI 

CATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCT 185 
TCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGT 24 0 

I M M I 1 1 1 1 1 1 1 1 M 1 1 M M M ! ! ! ! ! ! M M ! M ! M M ! ! M M ! ! ! ! ! M ! M 

TCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGT 24 5 
ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 0 0 

MMMMMMMM MiMMMMMMMMMMMMMM MMMMMI 

ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 0 5 
CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 3 60 

IMMMMMMMMMMMMMMM Ml MMMM M MMMMMMMM 

CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 365 



Qy 


361 


Db 


366 


Qy 


421 


Db 


426 


Qy 


481 


Db 


486 


Qy 


541 


Db 


546 


Qy 


601 


Db 


606 


Qy 


661 


Db 


666 


Qy 


721 


Db 


726 


Qy 


781 


Db 


786 


Qy 


841 


Db 


846 


Qy 


901 


Db 


867 


Qy 


961 


Db 


927 


Qy 


1021 


Db 


987 


Qy 


1081 


Db 


1047 


Qy 


1141 


Db 


1107 



CAACCAAATCTATGATAAATTCAAGGGCACCCCCCA 420 

! MINIM I II III ML I II IIMIIM I II III III Ml Mill I MM 1 1! II I 

CAACCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACA^ 425 
GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 8 0 

1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! I IIIMI ! 1 1 1 1 1 1 1 I ! 1 1 

GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 4 85 
GCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGA 540 

I IMIIMI MMMMMMMMMMMMMMMMMMMMMMIMM 

GCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGA 54 5 
TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 600 

1 1 1 1 1 1 1 1 i 1 1 1 1 J 1 1 1 1 1 : 1 III] ! 1 1 1 [ I ! 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 [ I i 1 1 1 ! 1 1 

TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 6 05 
CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 6 6 0 

1 1 1 1 1 1 1 j i r j i j 1 1 1 1 1 r 1 1 1 f 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 

CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 665 
TCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 72 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

TCGCCTCAGTGCCCACTCTTCCTCTGAC^GCAAAGATAACACACTCCACGTGGAAATTAA 725 
CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 780 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 785 
CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 0 

Ml MM III II 1 1 II I M II 1 1 1 1 1 II 1 1 II II I IM I Ml I II I M II 1 1 1 II I III 

CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 5 

CCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTA 900 

I I I I IIIMIIIIIIIIIIII 
CCGA GCCCTGGATACCAACTA 8 66 



1 1 1 1 1 1 I 1 1 ! [ , j 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 . 1 1 1 r 1 1 1 1 1 1 1 1 



GGA C CTGGG CTGGAAGTGGATT CATGAAC C CAAGGG CTAC CATG CCAATTT CTG C CTGGG 102 0 

IIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

GGA C CTGGG CTGGAAGTGGATT CATGAACCCAAGGGCTACCATGCCAATTTCTGC CTGGG 9 86 
GCCCTGTCCCTA(^TCTGGAGCCTAGACACT(^GTA(^GCAAGGTCCTGGCTCTGTAC^A 108 0 

II I III MM II I III I I II 1 1 II III II 1 1 1 1 MM MM II I M I II I II II M Ml 

GCCCTGTCCCTACATCTGGAGCCTAGAC^CTOVGTACAGCAAGGTCCTGGCTCTGTACAA 104 6 
CCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACT 114 0 

III III Mil III III I I II! IMIIIMI I III MMMIMMMMIMM II Ml 

CCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACT 1106 

GCCC^TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGT 12 0 0 

I Mil I I I I I I I I M I i 1 1 1 1 1 I I I I I : I I I I 1 1 1 1 i I : 1 1 1 1 1 1 1 1 1 1 M I 

GCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGT 1166 



Qy 12 01 GCGTTCCTGC^AGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGC 12 6 0 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 'I 1. 1 hll I II IliMMIII II. Ill I MM II M III 

Db 1167 GCGTTCCTGCAAGTGCAGCTGA-GCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGC 1225 

Qy 12 61 CCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCC 132 0 

1 1 1 ' 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 i I ! I ! I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1226 CCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCC 12 8 5 

Qy 1321 A CTTGGG AT CG ATTAAAG CGG C CG CG A 134 7 

1 1 MMMMMMMI I I I I 

Db 12 8 6 AC - TGGGATCGATTAAAGGTGGAGAGA 1311 



RESULT 3 

GGTGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



gene 
5 1 UTR 

CDS 



GGTGFB1 175 0 bp mRNA linear MAM 2 7 -MAR- 1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI:63808 

transforming growth factor-beta 1. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 1750) 

Jakowlew, S . B . , Dillard, P. J. , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 
(TGF-beta 1) 

Nucleic Acids Res. 16 (17), 8730 (1988) 

88335639 

3166520 

2 (bases 1 to 1750) 
Jakowlew, S . B . 
Direct Submission 

Submitted ( 14 - JUL-1988 ) Jakowlew S.B., National Institute of 
health, National Cancer Institute, Laboratory of Chemoprevent ion, 
Building 41, Room B902, Bethesda, Maryland 20892, USA 
The submitters believe that the chicken cDNA library was 
contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 

Location/Qual if iers 

1. .1750 

/organism="Sus scrofa" 
/mol_type= "mRNA" 
/strain= "white leghorn" 
/db_xref =" taxon: 9823" 
/ c 1 one = " pTGFB - ChX 119" 
/eel l_type=" chondrocyte" 
1. .1750 

/gene=" TGF-beta 1" 
1. .446 

/gene=" TGF-beta 1" 
447. .1622 
/gene="TGF-beta 1" 
/ codon_start=l 

/product=" transforming growth factor" 
/protein_id="CAA30933 .1" 



/db_xref="GI : 63809" 
/db_xref =" SWISS -PROT: P07200 n 

/ trans la t ion= "MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLD 
TQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS" 
mat_peptide 1285. .1619 

/gene="TGF-beta 1" 
3'UTR 1620. .1750 

/gene="TGF-beta 1" 

BASE COUNT 325 a 627 c 479 g 319 t 

ORIGIN 



Query Match 90.0%; Score 1213.8; DB 4; Length 1750; 

Best Local Similarity 95.6%; Pred . No. l.le-204; 

Matches 1286; Conservative 0; Mismatches 17; Indels 42; Gaps 2; 



Qy 


6 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II Ml MINIMIM 1 M 1 1 1 1 M M M 1 M M M M M II II 1 Ml 1 1 M 

CCCCCATGCCGCCTTCGGGGCCTGGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 


65 


Db 


442 


501 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

MUM Ml lllllll M MMMIIMM MIIMI MMM Ml MM 1 MMMM 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 


125 


Db 


502 


561 


Ov 


126 


1 1 1 1 1 1 1 1 1 1 1 1 e e r 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 


lo J 


Db 


562 


621 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 ! 1 1 M i 1 1 1 1 1 1 1 1 1 1 1 1 II ' 1 1 1 1 1 i 1 1 1 . 1 1 1 1 1 1 1 1 1 1 i 1 1 MIMM 

TCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 


245 


Db 


622 


681 


Qy 

Db 


246 
682 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r 1 1 1 1 1 r 1 1 r 1 1 r 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


305 
741 


Qy 

Db 


306 
742 


C^GAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 II 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


365 
801 


Qy 


366 


AAATCTATGATAAATTC^GGGCACCCCCC^C^GCTTATATATGCTGTTCAACACGTCGG 

1 MINI lllllll IMIIIIIIIIIIII MIIMI MINIMIM 1 MINI Mill 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTT 


425 


Db 


802 


861 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

1 M 1 M 1 1 MM 1 M 1 1 1 1 M M 1 M 1 M 1 MM M 1 1 II 1 1 1 1 1 1 1 1 1 M M 1 MM, 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 


Db 


862 


921 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGC^GCACGTGGAGCTATACC^GAAATACAGa^TGATTCCT 

1 1 1 1 1 II 1 1 IM 1 M 1 II 1 1 1 II 1 M M II 1 Ml M 1 1 III M 1 M M III M IMM M 

GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 


Db 


922 


981 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 



II IIMM IMII. Mill llllillllMIIIIMIMIIIII Mill IIMI 

Db 982 GGGGCTACCTCAGC7VACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 1041 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 1 II M M M 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M II M 1 1 

Db 1042 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 1101 
Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAAC G 722 

MMIMMMI IMM M M I M 1 1 1 1 1 M M M II M M I M M 1 1 M M M I 

Db 1102 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGCAG 1161 

Qy 723 GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 782 

MIMMIIIMMIIIMIIIM MM MMMMMMMMMMIMIMM 

Db 1162 GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 1221 

Qy 783 TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 84 2 

1 1 1 ! 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 ; 1 1 : 1 I i I ! I I I I I I 1 1 I I I I 1 1 I I I I I I i 

Db 1222 TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 12 81 

Qy 843 GAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACT 902 

I I II II 1 1 M I Ml 1 1 II II! 

Db 12 82 GA GCCCTGGATACCAACTACT 13 02 

Qy 9 03 GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 962 

I I I i 1 1 1 1 1 : ! 1 1 1 1 IMM 1 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 13 03 GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 1362 

Qy 963 ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1022 

I M 1 1 ! 1 1 1 1 : M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 ! 1 1 ^ 1 1 1 1 1 1 1 1 [MM I 

Db 1363 ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1422 

Qy 1023 CCTGTCCCTACATCTGGAGCCTAGAC7VCTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 1082 

I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1423 CCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 14 82 

Qy 1083 AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 1142 

MIMIMI IMIMIMM MMMIMIMMMIMMIMMIMM Mill 

Db 14 83 AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 154 2 

Qy 1143 CC^TCGTGTACTACGTGGGCCGCAAGCCO^GGTGGAGCAGCTGTCCAACATGATCGTGC 12 02 

i MM II II 1 1 1 1 1 II 1 1 M I II 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 III M I II 1 1 M III 

Db 154 3 CC^TCGTGTACTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCC^ACATGATCGTGC 1602 

Qy 12 03 GTTCCTGC^GTGC^GCTGAGGCCCCGCCCCGCCC^CAGCCCCGCCCACCCGGCAGGCCC 1262 

M M II II I M 1 1 1 1 M M 1 1 1 M M M MM I M l! 1 1 1 II M III 1 1 1 1 II 1 1 1 1 1 

Db 16 03 GTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCC 1662 

Qy 12 63 GGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCAC 1322 

I I I I I I I I I ' I I I I I I I I I I I I I I I I ' I I I I I I I I I I I I I I I I I I I I I I 

Db 1663 GGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCAC 1722 

Qy 1323 TTGGG AT CGATTAAAG CGG C CG CG A 134 7 

II MIIMIMI II I I II 

Db 1723 TTGGGATCGATTAAAGGTGGAGAGA 174 7 



RESULT 



4 



SSTGFBR 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



mat_peptide 



misc feature 



misc feature 



SSTGFBR 1605 bp mRNA linear MAM 27-MAR-1995 

Porcine mRNA for transforming growth factor-beta (TFG) precursor. 
Y00111 

Y00111.1 GI:2129 
transforming growth factor-beta. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R. and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Locat ion/Qua 1 i f iers 
1. .1605 

/organism="Sus scrofa" 
/mol_type=" mRNA" 
/db_xref = M taxon: 9823" 
/ 1 i s sue_t ype = " ova ry 11 
/clone_lib= M lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291 . 1" 

/db_xref ="GI :2130" 

/db_xref ="SWISS-PROT: P07200" 

/ trans la tion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMLESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
1238. .1573 

/product = "mature TFG-beta" 
647. .655 

N-glycosylat ion site" 



BASE COUNT 
ORIGIN 



298 a 



/note="pot . 
809. .817 
/note="pot . 
572 c 



N-glycosylat ion site" 
437 g 298 t 



Query Match 84.8%; 
Best Local Similarity 96.1%; 
Matches 1198; Conservative 



Score 1143 .6; DB 4; 
Pred. No. 2.7e-192; 
0; Mismatches 9; 



Length 1605; 
Indels 39; 



Gaps 



1; 



Qy 



Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

M Ml II! I II I ' 1 1 1 1 1 1 1 1 1 1 1 I ! I ! , 1 1 1 1 ! I 1 1 1 1 , ! 1 1 1 ! I i 1 1 1 1 1 

3 9 9 CCCCCATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 4 58 



Qy 



Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

1 1 1 1 I 1 1 1 1 1 1 1 II I ! II 1 1 1 1 1 1 , M || 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 . 1 1 II ILi, 1 1 M 

4 59 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 518 



Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

III 1 IIIIIMII 1 II 1 1 1! II 1 II 1 1 1 MM: II: II 1 II ! 1 1 1 MM II 1 !l 1 'II 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 


185 


Db 


519 


578 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 MMMMMMMMIMMMMMMMMMMMMMMMMI MMMI 

TCGCCAGCCCGCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGG 


245 


Db 


579 


638 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGGCCGAGC 


305 


Db 


639 


MMMMMMMMMMMMMMMMMMMIMMMMMMMMMMI 

CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


698 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

MMMMMMMMMMMMMMMMMMMMMM lllllllllllllll 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGCTGGAAAGCGGCAACC 


365 


Db 


699 


758 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II M M M 1 M 1 1 1 II M 1 1 II 1 1 1 M 1 M M 1 M 1 M 1 M 1 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


759 


818 


Qy 

Db 


426 
819 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

1 M III Ml IMMI Mill ' II Ml MMMI II MMI 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 
878 


Qy 

Db 


486 
879 


GGGTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

MM II MMMMMIMMMIMM IMIIIIIMMIIIIIIIIIIIIMMI 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 
938 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM Ml MMI MM M M II IIIIIIIMIII IMMI II IIMIMI II MM Ml 

GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


939 


998 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 : 1 1 1 1 1 1 

ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


999 


1058 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

Mllllllllll Mill 1 MM M 1 1 III M M 1 1 II Ml M 1 M 1 II 1 1 MM Ml 

TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 


725 


Db 


1059 


1118 


Qy 

Db 


726 
1119 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

1 1 II Ml 1 1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 II II 1 1 II M 1 1 1 1 II Ml 1 II 1 II 1 1 II 

TCAATTCTGGCCGCCGGGGTGAGCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


785 
1178 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

MMMMMMIMM IMMIMI Mill! MMMMMMMMMMMM 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA- 


845 


Db 


1179 


1237 


Ov 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGrrCTnGATACCA ACTAGTGCT 

1 M II 1 MM MM II II 1 1 II 

GCCCTGGATACCAACTACTGCT 


q n r 

y \j zj 


Db 


1238 


1259 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 II 1 II M II II II 1 1 1 II II MM M MM II II 1 II M M 1 II I II 1 II MM 1 II 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 


965 


Db 


1260 


1319 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 


1025 



IIIIIIIIIIIIIIMII IIMI IMIIIIIIIMI llllllllllllllllllill 

Db 132 0 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 13 7 9 

Qy 102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 13 8 0 GTCCCTACATGTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 143 9 

Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

ii II II I .1 Mill I : I IIIM III I II 

Db 14 4 0 ACAACC CGGG CG CGT CGG CGG CG C CGTG CTG CGTG CCG CAGG CG CTGGAGCCACTG C C GA 14 99 

Qy 114 6 TGGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

III II II II llllllllllllll IIIIIIIIIIIIIIM I! INI! II II 

Db 15 00 TGGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1559 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 12 51 

III ll.lllll IIIIIIMIIIIIMII lllllllllllll I 

Db 1560 GGTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCAC 1605 



RESULT 5 
AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFB1) mRNA, complete 
cds . 

AF461808 

AF461808 .1 GI : 18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomdej , S . , Ponsuksil i , S . and Schellander, K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schellander, K. 
Direct Submission 

Submitted (20 -DEC-2001 ) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Location/Qual if iers 

1. .2221 

/organism="Sus scrofa" 

/mol_type= "mRNA" 

/db_xref ="taxon: 9823" 

/ chromosome= " 6 " 

1. .2221 

/gene="TGFBl" 

1. .1173 

/ gene=" TGFB1 " 

/not e= " cytok ine " 

/codon_start=l 

/product="transforming growth factor beta 1" 
/protein_id="AAL57902 .1" 
/db_xref ="GI : 18042251" 

/translat ion= "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 



RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLiyLATPLERAQHL.HSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVIALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
3'UTR 1174. .2221 

/ gene= M TGFB1 " 

BASE COUNT 504 a 616 c 649 g 452 t 

ORIGIN 

Query Match 83.2%; Score 1122.2; DB 4; Length 2221; 

Best Local Similarity 96.2%; Pred . No. 1.6e-188; 

Matches 1176; Conservative 0; Mismatches 8; Indels 39; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 7 0 

III I Mill Mlllllllllllllllllllllllllllllllllll lllllllll 

ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

MIMMIMM IIMIIIIIIIMI MIMMIMIMMIMIMI lllllllll 

CTG A CG C CTGG CCGG C CGG C CG C CGG A CTGTC CAC CTG CAAG A C CAT CG A CATGG AG CTG 120 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

1 1 1 1 1 1 1 1 1 1 1: 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M I M 1 : 1 1 mi 

GTG AAG CGGAAG CG CAT CGAGG C CATT CG CGG C CAGATT CTGTC CAAG CTTCGGCTCGCC 18 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 50 

MMM Mill MIMMMMMI MIMMIMIMlIMM lllllllll 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 24 0 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

I II ' ' I I I : I I I I I! I ; I 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 37 0 

II M M 1 1 M M 1 1 M I 1 1 M I M I M 1 1 1 1 II III I II II I M I II I M 1 1 1 II 1 1 : 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

1 1 1 1 1 1 1 1 1 i I M 1 1 II 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 . 1 1 ! 1 1 1 1 1 1 

TATGATAAATTCAAGGG^CCCCCC^C^GCTTATATATGCTGTTOUVCACGTCGGAGCTC 4 2 0 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 90 

I I ; 1 : 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 II M ! I It I ! 1 1 1 1 1 I II I 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 
AAGTTAAAAGTGGAGC^GC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 550 

1 1 1 1 1 1 II M III 1 1 1 1 II M 1 1 II I II I M Ml Ml I II I M I M 1 1 1 M II I III I 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

1 1 1 1 1 1 M ! 1 1 1 1 1 II 1 1 1 1 1 1 1' 1 1 1 1 1 1 1 1 1 lllllllll I ; 1 1 1 1 1 1 1 1 1 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

I I I 1 1 I . I I ^ I I I I I I I I 1 1 I I I I I I I I I . I II I I 1 1 I ! I I I I 1 1 I I I I I I I I I I I 



Qy 


n 


Db 


i 


Qy 


71 


Db 


61 


Qy 


131 


Db 


121 


Qy 


191 


Db 


181 


Qy 


251 


Db 


241 


Qy 


311 


Db 


301 


Qy 


371 


Db 


361 


Qy 


431 


Db 


421 


Qy 


491 


Db 


481 


Qy 


551 


Db 


541 


Qy 


611 



Db 



601 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 66 0 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



671 



661 



731 



GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

II Mill . 1 1 1 : 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 i 1 1 1 II 1 1 1 1 1 1 ' 1 1 1 1 1 II 

GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

Ml MIMIIIIIIIMMIIIMIIIMII IMMM II! Ill IIIIIIIMII 

721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 



791 



781 



851 



835 



911 



862 



971 



922 



ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGGTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGA 

GATACCAACGACTACAAGGATGACGACGACAAGGCGCTGGATACCAACTACTGCTTCAGC 

1 1 1 M II 1 1 1 II 1 1 1 Ml M II 1 1 III 

GGCCTGGATACCAACTACTGCTTCAGC 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 

1 1 1 1 M I II MM I M 1 1 1 1 1 1 II I II II MM I M 1 1 1 1 1 1 II 1 1 II 1 1 M 1 1 1 1 1 II 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 



730 



720 



790 



780 



850 



834 



910 



861 



970 



921 



TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 103 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 ; 1 1 1 1 1 1 1 1 i 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 

TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 981 



1031 TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAC 1090 

ill 1 1 1 1 1 i i 1 1 1 : 1 1 1 1 1 1 1 . 1 1 1 . 1 1 1 1 1 e 1 1 1 1 1 1 1 : , 1 1 1 

982 TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTGTGTACAACCAGCACAAC 1041 
1091 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1150 

1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 M I III 1 1 1 1 M 1 1 1 II 1 1 II I III 1 1 1 II II I II II I II 

1042 CCGGGCGGGTCGGCGGCGCCGTGCTGCGTGCCGCAGGGGCTGGAGCCACTGCCCATGGTG 1101 
1151 TACTAGGTGGGCGGCAAGCCCAAGGTGGAGCAGGTGTCCAACATGATCGTGCGTTCCTGC 1210 

III MM IM MIMIMM Ml IMMMMIMM MMIIIIMM 

1102 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1161 

1211 AAGTGCAGCTGAGGCCCCGCCCC 1233 

IM MM II I II 
1162 AAGTGCAGCTGAGGCCCCAAGCC 1184 



RESULT 6 

DOGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



linear MAM 30-OCT-1994 



(TGFB1) mRNA, complete 



DOGTGFB1A 136 9 bp mRNA 

Canine transforming growth factor-beta 1 
cds . 
L34956 

L34956 .1 GI : 516071 

homologue; transforming growth factor-beta 1. 
Canis familiaris (dog) 
Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; 
1 (bases 1 to 1369) 

Manning, A.M. , Auchampach , J . A . , Drong,R.F. and Slightom,J.L 



Euteleostomi ; 
Cams . 



TITLE 

JOURNAL 
COMMENT 

FEATURES 

source 



gene 



5 ' UTR 



CDS 



mat_pept ide 



3 'UTR 



BASE COUNT 
ORIGIN 



Cloning of a canine cDNA homologous to human transforming growth 
factor-beta 1 (TGFbetal) 
Unpublished (1994) 

Original source text: Cams familiaris adult jugular vein 
endothelial cDNA to mRNA. 

Locat ion/Qua li f lers 

1. .1369 

/organism^ "Canis f amil iaris " 

/mol_type= M mRNA n 

/db_xref="taxon: 9615" 

/eel l_type=" LPS -activated" 

/tissue_type="jugular vein endothelial" 

/ dev_s tage= " adul t " 

1. .1369 

/gene="TGFBl" 

1. .57 

/gene=" TGFB1 " 
58. .1230 
/gene="TGFBl" 

/ f unct ion= " ant i - inflammatory agent " 

/note= "precursor" 

/codon_start=l 

/product =" transforming growth factor-beta 1" 
/protein_id="AAA51458 . 1" 
/db_xref="GI: 516072" 

/ t rans 1 a t ion= " MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
Y YAKEVTRVLMVENTNKI YEKVKKS PHS I YMLFNTS ELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFR 
LSAHCSCDSPODNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
1039. .1227 
/gene="TGFBl" 

/product="transforming growth factor-beta 1" 
1231. .1369 
/gene="TGFBl" 
264 a 473 c 415 g 216 t 1 others 



Query Match 74 . 3%; 

Best Local Similarity 87.4%; 
Matches 1148; Conservative 



Score 1002.6; DB 4; 
Pred. No. 2.3e-167; 
0; Mismatches 125; 



Length 1369; 
Indels 41; Gaps 



3; 



Qy 

Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III II 1 1 II lllllllllll lllllllllllllllllllllll MM I 

53 CCCCCATGCCGCC CTCCGGG CTG CGG CTG CTG CCG CTG CTG CTG C CG CTG CTG CGG CT AC 112 



Qy 

Db 

Qy 

Db 

Qy 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I I I . M II I I M I I I I I I I I I I I I i I I I I I 1 I I I I I I I I I < I I 1111111:11 I 
113 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 172 



126 



185 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

I I I I I I I I I I I I I I ) I I I I 1 I I I I I I I I I I lllllllllll lllllllllll MM 
173 AACTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 232 

186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



I llllllllllllllllllllll llllllllll llllllll Mill II 1 1 1 1 

Db 233 TCTCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGTGCCGCTGCCCGAGGCCGTGCTGG 2 92 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll llllllllllllll II Mill II I III lllllllllllll 

Db 2 93 CCCTCTACAACAGCACCCGCGACCGGGTGGCGGGGGAGAGCGCCGAGCCGGAGCCCGAGC 3 52 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I I II 1 1 1 II 1 1 1 1 II 1 1 II II II 1 1 1 1 1 1 II 1 1 1 II 1 1 M II I II II I II I MM 

Db 3 53 CCGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAACACCAACA 412 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

llllllllll III Mill I II I Ml II I III 1 1 II II llllllll II I 

Db 413 AAATCTATGAGAAAGTCAAGAAAAGTCCGCACAGCATATATATGCTCTTCAACACATCAG 4 72 

Qy 42 6 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

lllllll Mill Mill II II II llllllll Ml III III II 1 1 Ml II I II 

Db 4 73 AGCTCCGAGAAGCAGTGCCTGAGCCCGTCTTGCTCTCCCGGGCAGAGTTGCGCCTGCTGA 532 

Qy 486 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

Illllllllllllll lllllllll II II MM lllllllllll llllllll II Ml 

Db 53 3 GGCTCAAGTTAAAAGCGGAGCAGCATGTGGAGCTGTACCAGAAATATAGCAATGATTCCT 5 92 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

Mill III III MM I II MM II I III Mill III I II II II I II III Mill 

Db 593 GGCGCTACCTCAGCAACCGGCTGCTGGCGCCCAGCGACACGCCAGAATGGCTGTCCTTTG 652 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

lllllll Mill III t I I I I 1 I I I I I I II III II I Mill lllllll 
Db 653 ATGTCACTGGAGTCGTGAGGCAGTGGCTGAGCCATGGAGGGGAAGTCGAGGGCTTTCGCC 712 

Qy 666 TC^GTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

Mill Ml Ml I Mill III MIMIIIIIII III III II M II llllllllll 

Db 713 TCAGTGCCCACTGTTCCTGTGACAGCAAAGATAACACACTGCAAGTAGACATTAACGGGT 772 

Qy 726 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III III lllllll I I I I I I I I I I I I I I I I I I i lllllllllll llllllllll 
Db 773 TCAGTTCCAGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGACCCTTCCTGC 832 

Qy 78 6 TCCTCATGGGCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

Illlllllllllllll 1 1 1111.11 1 II . 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 : 1 1 Mill 

Db 833 TCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCAGCGCCG-- 8 90 

Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 9 05 

lllllllll lllllllllllll 
Db 891 GGCCCTGGACACCAACTACTGCT 913 

Qy 906 TC^GCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

MIIIMIIIIIIIIIIIIIIIMIII I ' 1 1 : 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 : 1 1 Mill I 

Db 914 TC^GCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATC 973 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

; i 1 1 : 1 1 1 1 1 ! i m i mil iiiiiiii mil ii ii iiiiiiiiiiiiiiii 

Db 974 TGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCT 1033 

Qy 102 6 GTCCCTAC^TCTGGAGCCTAGACACTCAGTACAGC^GGTCCTGGCTCTGTAa^CCAGC 1085 

I llllllll llllllll Mill i 1 1 1 1 1 1 1 1 1 ! 1 1 1 IMIMMIIMI 



Db 



1034 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1093 



Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I I I I I I I II I I I I I I II I I I I II I I I I I I II II I I I I I I I I I I I I I II I I I I M I I II II 
Db 10 94 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1153 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1205 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I llllllllilllll I 
Db 1154 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCT 1213 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCG-CCCACAGCCCCGCCCACCCGGCAGGCCCGG 1264 

Ml IIMIIIIIIIMI Mill MM II III III II 11:11 1 1 1 1 
Db 1214 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCAGGCCCCGCCCACCGGCAGGNCCGG 12 73 

Qy 12 65 CCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGC 1318 

I I I I 1111,11 III I || I I I II I I I I I IM I I I II Mllll II 
Db 1274 CCCCGCCCCCGCCCGCTGCGCC-GGGCTGTATTTAAGGACACCGCGCCCCAGGC 1326 



RESULT 7 

E00973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



linear PAT 29-SEP-1997 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



E00973 2527 bp RNA 

cDNA encoding human TGF-beta. 
E00973 

E00973 . 1 GI :2169234 
JP 1986219395-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Euthena; Primates; 
1 (bases 1 to 2527) 
Riku, M . A. D. D . and Debit sudo , B . G . 
NUCLEIC ACID CODED WITH TGF-BETA AND ITS USE 
Patent: JP 1986219395-A 1 29-SEP-1986; 
GENENTECH INC 
OS human 

PN JP 1986219395-A/l 

PD 29-SEP-1986 

PF 20-MAR-1986 JP 1986064661 

PR 22-MAR-1985 US 85 715142 

PI RIKU MAIKERU ANDORE DERINKU, DEBITSUDO BANNOOMAN GETSUDERU 
C12P21/00,C12Nl/00,C12N5/00,C12N15/00//C12Ql/68, (C12P21/00, PC 
C12R1:91) , 



PC 



PC 

cc 
cc 
cc 
cc 
cc 

FH 
FT 
FT 
FT 
FT 
FT 
FT 



(C12N1/00,C12R1:19) , (C12N5/00 , C12R1 : 9 1 ) , (C12N15/00 , C12R1 : 91) 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
anti-sense: No; 

♦source: t issue_type=placenta and glyoblastoma ; FH Key 
Loca t i on/ Qua 1 i f i er s 



CDS 

mat_peptide 

5 1 UTR 
3 'UTR 



842. .2014 
/product= 'pre 
1676. .2011 
/product^ ' TGF- 
1. .841 
2015. .2537 



TGF-beta ' 



beta 1 



FT stem_loop 37. .113 

FT stem_loop 2015. .2100 

FT polyA_site 2514. .2519. 

FEATURES Loca t ion/ Qua 1 i f i ers 

source 1. .2527 

/organism= "Homo sapiens" 
/mol_type= "genomic RNA " 
/db_xref="taxon: 9606" 
BASE COUNT 472 a 888 C 735 g 432 t 

ORIGIN 

Query Match 73.5%; Score 991.6; DB 6; Length 2527; 

Best Local Similarity 85.7% ; Pred. No. 1.9e-165; 

Matches 1142; Conservative 0; Mismatches 149; Indels 41; Gaps 2; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I I I II I' II I I II I Ml llllllll III I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTG CTGA CG C CTGG C CGG C CGGCCGC CGGACTGTCCAC CTG CAAGAC CAT CGACATGG 12 5 

I Mlllll llllllll lllllllll Mill i III Ml H II llllll 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

I MM II II MM III I Mill llllll III I III Ml MM I II II MUM MM 

Db 9 57 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTG C CAG C C C C C CGAG C CAGGGGGA CGTG C CG C C CGG C C CG CTG C C TGAGG CAGTA CTGG 24 5 

I lllllllll.il Mill II III I II 1 1 1 II 1 1 1 Ml 1 1 MM II II I 

Db 1017 TCG CCAG C CC C CCGAG C CAGGGGGAGGTG CCG C C CGG C CCG CTG C C CGAGG C CGTG CT CG 1076 

Qy 246 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll IM Ml MM MM MM II MM 

Db 1077 CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 306 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I 1 1 II I IM IM III II M Ml IMMI Ml I M I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCAC CCG CGTG CTAATGGTGGAAAC CCA CAACG 1196 

Qy 366 AAATCTATGATAAATTC^GGGC^CCCCCCA^GCTTATATATGCTGTTO^CACGTCGG 42 5 

MIMIMII II llllll I I llllll llllllll I llllllll II I 
Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^ 12 56 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

Mlllll llllllll II Mill II llllllll MIMMMIMM Mlllll 

Db 12 57 AG CTCCGAGAAG CGG TA C CTGAA C CCGTGTTG CT CTC CCGGG CAGAG CTG CGT CTG CTGA 1316 

Qy 4 86 GGCTO^GTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

Ml IMII II MM I I IMIMI II MM I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 IMMI 

Db 1317 GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTGTACCAGAAATAC^GC^a^ATTCCT 1376 

Qy 546 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM I 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I M IMMI I II MM 

Db 1377 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 14 3 6 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 1 II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 III III III II II Mill Mlllll 



Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



14 3 7 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 
666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II lllllll 1 1 II llllllll MINIMI II Mill II lllllll 

14 97 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1556 
72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml I I MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIMIIIIIIIIII lllllll 

15 57 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 
786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I i 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1' 1 1 1 1' 1 Mill IIIIIIIIMIIMIIII 

1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 167 5 

84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

llllllll llllllll II I I 
1676 GCCCTGGACACCAACTATTGCT 16 97 

906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

M M M M M Ml MM I I'l 1 1 M i M 1 1 1 1 1 1 1 llllllllllllll lllllll 

16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 

966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

I llllllllllllll II II IMIIIIIIIIIIIIIIIII llllllll lllllll 

17 58 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

I llllllll llllllll Mill IIIIIIIIIMIIIIIIIII I ' ' I , I I I I I I 
1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I I I I I I I ! I I I I I I I I I I I I I I I I I I . I I I I I I ! I I I . I i I I I I ' 1 I I I lllllll 
1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 ! M M 1 1 1 Ml 1 1 ! 1 1 1 1 1 1 I 

1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCT 19 97 
12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

I M ! M I M 1 1 1 1 1 1 1 1 ! I MINIM I I MM Ml II III 

1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC- - CCGCCCCGCCCGGCCCGACCCCGCCCC 2 055 
12 66 CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 132 5 

II II I III I II I M 1 1 M 1 1 1 II I NIM 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 

2 056 GCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAGCCCACCTG 2115 

1326 GGATCGATTAAA 1337 

III III 
2116 GGGCCCCATTAA 2127 



RESULT 8 

A06669 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 



A06669 2537 bp 

Synthetic mRNA for preTGF-Betal . 
A06669 

A06669 . 1 GI :412940 



mRNA 



linear 



PAT 29-JUL-1993 



KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



synthetic construct 
synthetic construct 
artificial sequences . 
1 (bases 1 to 2537) 

NUCLEIC ACID ENCODING TGF- beta 3 AND ITS USE 
Patent: WO 8912101-A 4 14-DEC-1989; 

Locat ion/Qua 1 i f i ers 

1. .2537 

/organism=" synthetic construct" 

/mol_type= " mRNA " 

/db_xref = " t axon : 3 2 6 3 0 " 

842. .2014 

/ codon_s tart = 1 

/transl_table=ll 

/product = " preTGF -bet a 1 " 

/protem_id= n CAA00588 .1" 

/db_xref="Gl : 412941" 

/ trans la t ion= "MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDI NGFTTGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
473 a 893 c 739 g 432 t 



Query Match 73 . 4%; 

Best Local Similarity 85.5%; 
Matches 1146; Conservative 



Score 9 9 0.8; DB 6; 
Pred. No. 2.6e-165; 
0; Mismatches 147; 



Length 2537; 
Indels 47; 



Gaps 



2; 



Qy 
Db 



6 CCGAGATGG CGC CTT CGGGG CTG CGG CT CTTG CCG CTG CTG CTG CCG CTG CTGTGGCTG C 6 5 

II III I I I I II lllllllllll llllllllllll I I I II ' 1 I MM I 
837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 

Db 

Qy 
Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I IIIMIIIII II II llllllll Mill 1 1 1 i I i 1 1 1 1 1 1 1 MM 

8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 



126 



185 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

1 1 1 M M Ml MM I M I Ml I M I M I M lllllllllll MIMIIIIII Ml 

957 AG CTGGTGAAG CGGAAG CG CATCGAGG C CAT CCG CGG C CAGAT C CTGTCCAAG CTG CGG C 1016 



Qy 

Db 

Qy 

Db 

Qy 

Db 



186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I MINI llllllll I i II I MMMMMMMMM MM II II I 

1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



246 



305 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

I II llllllll MM llllllll Ml MIMMIIMIIII 

1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 



306 



365 



CAGAGG CGGA CTACTACG C CAAGGAGG T CAC C CG CGTGCTAATGGTGGAAAG CGG CAACC 

I MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 

1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 



Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


1197 


MINIUM II llllll 1 1 llllll llllllll 1 MMMM II 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


1256 


Qy 
Db 


426 
1257 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTGTCTCGGGCAGAGCTGCGCCTGCTGA 

lilllll MMMM II Mill II MMMM llllllllllllll IMIIII 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 
1316 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 llllll 
1 1 1 i II 1 II 1 II II 1 1 1 II II 1 II I 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 llllll 

GGCTC^AGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


1317 


1376 


Qy 


546 


GGCGCTAGCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM IMIIIIIIIIMIIIIMMII Mill Mill II llllll 1 II MM 

GGCGATACGTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1377 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 III III III II II 1 1 1 1 1 1 1 1 1 1 1 1 


665 


Db 


1437 


ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


1496 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II IMIIII MM MMMM MMMMMI II Mill II lilllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


725 


Db 


1497 


1556 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 llllllll llllllllllllllllllll llllllllllllll IMIIII 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 MMMMMIMMMMMIMMIMM Mill IMMMIMMMIMI 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1617 


1675 


Qy 


846 


CGCTGGATAGCAACGAGTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

MMMM MMMM MM 

GCCCTGGACACCAACTATTGCT 


905 


Db 


1676 


1697 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

II 1 1 1 M II 1 1 II 1 II 1 II 1 1 1 II M II II 1 II II M MIMMMIMM MMMI 

T(^GCTC(^CGGAGAAGAACTGCTGCGTGCGGC^GCTGTAC^TTGACTTCCGCAAGGACC 


965 


Db 


1698 


1757 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 MIMMMIMM II II MMMMMIMMIMM llllllll MMMI 

TCGGCTGGAAGTGGATCC^CGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1025 


Db 


1758 


1817 


Qy 


1026 


GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

1 llllllll llllllll Mill llllllllllllllllllll Mill llllllll 

uLLLL li-i^—M. Ill UuHuLL 1 LjoA^-x\L-LjL-M.oi 1 AL^LjL-AAoLj 1 CL. 1 bbLLL 1 <j 1 AL-AAL.L-A.LjL- 


1085 


nh 


i ft i ft 


1 O "7 "7 

15// 


Qy 


1086 


ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 

1 MMMMMI MMMI. 1 1 M M MM Ml III MM MMMI '1 MMMI 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 


1145 


Db 


1878 


1937 


Qy 


1146 


TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 

1 ! 1 J 1 1 1 ! J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 i 1 1 j i 1 1 1 1 ( 1 1 1 1 [ 1 1 1 1 1 1 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 


1205 


Db 


1938 


1997 



Qy 



12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 



1 1 : 1 1 1 I , I 1 1 1 1 1 1 1 1 1 llllllllllll II I II I I I Mill I 

Db 19 98 CCTGOU\GTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 

Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

III I I I I III II II I I < I I i I i II 1 I I I I I I I Mlllllllll 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 133 7 

Mill MM I I M 

Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 9 

OATGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



mat_peptide 



OATGFB1 1173 bp mRNA linear MAM 18-APR-1995 

0. aries mRNA for transforming growth factor-beta I. 

X76916 

X76916 . 1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis aries (sheep) 
Ovis aries 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Ruminant ia; Pecora; Bovoidea; 
Bovidae; Caprinae; Ovis. 
1 

Woodall , C. J. , McLaren, L.J. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 

latent transforming growth factor-beta 1 

Gene 150 (2), 371-373 (1994) 

95121932 

7821809 

2 (bases 1 to 1173) 
Woodall, C. 
Direct Submission 
Submitted (24 -DEC- 1993 ) C. 
Veterinary Pathology, Sc. 
Edinburgh EH 9 IQH, UK 

Location/Qualif iers 

1. .1173 

/organ ism=" Ovis aries" 
/mol_type="mRNA" 
/db_xref = n taxon: 994 0" 
1. .1173 
/codon_start =1 

/product=" transforming growth factor-beta 1" 
/protem_id="CAA54242 .1" 
/db_xref="GI : 496649" 
/db_xref =" SWISS-PROT: P50414 " 

/ 1 rans 1 at ion= " MPPSGLRLLPLLLPLLWLLMLTPGRP VAGLSTCKTI DMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
YYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFFNTSELREAVPEPVLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LSAHCSCDSKDNTLQVDI NGFSSGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
835. .1170 

/product="transforming growth factor-beta 1" 



Woodall, Univ. of Edinburgh, Dept. of 
of Vet. Studies, Univ. of Edinburgh, 



BASE COUNT 245 a 378 c 336 g 214 t 

ORIGIN 



Query Match 73.0%; Score 984.8; DB 4; Length 1173; 

Best Local Similarity 89.6%; Pred. No. 3.3e-164; 

Matches 1086; Conservative 0; Mismatches 87; Indels 39; Gaps 1; 



Qy 


n 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III IIIIIIIIIIIIMMIII 1 1 1 1 h 1 1 II Ml 1 1 1 1 1 1 1 In 1 1 1 1 1 III 1 1 

ATGCCGCCTTCGGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGTGGCTACTAATG 


70 


Db 


i 


60 


Qy 


71 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

lllllllllllllllllll M M M M M 1 M MMM 1 M 1 1 1 II 1 1 M 1 M 1 M III 

CTGA CG CCTGG CCGG CCGGTCG C CGG A CTGT C CAC CTG CAAG AC CAT CGA CATGGAG CTG 


130 


Db 


61 


120 


Qy 


131 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

Mill Mill 1 1 1 f 1 1 1 1 F 1 1 1 1 Mill MINI MMM MMM III 

GTGAAGCGGAAGGGCATCGAGGCCATCCGCGGTCAGATTTTOTCCAAArTTrnnrTrnrr 


190 


Db 


121 


i on 


Qy 


191 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

II lllllllllllllllllllllll II I II 1 1 1 1 : Mill MMM III 

AGTCCCCCGAGCCAGGGGGACGTGCCACCCGGCCCGCTGCCCGAGGCCATACTGGCCCTT 


250 


Db 


181 


240 


Qy 


251 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

1 Ml M MM Ml 1 Ml Ml M 1 M 1 1 M MM MMM MINIMI 

TACAACAGTACCCGCGACCGGGTGGCCGGGGAAAGTGCCGAAACGGAGCCTGAGCCAGAG 


310 


Db 


241 


300 


Qy 


311 


GCGGACTACTACGC(^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGG(^CaWVTC 

MMMMMMIMMMMMMMMMMIMIIMIMM Mill! Mill 

GCGGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAATACGG(^(^AAATC 


370 


Db 


301 


360 


Qy 


371 


TATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTOWVCACGTCGGAGCTC 

Mill III 1 Ml III IIIIII MMM, 1 IIIIIIIIIII MMM 

TATGACAAAATGAAGTCTAGCTCGCACAGCATATATATGTTCTTCAACACGTCCGAGCTC 


430 


Db 


361 


420 


Qy 


431 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

IIIIIIII Mill IIIIIIII M M 1 1 Ml Ml III 1 1 II 1 II II 1 1 1 1 1 1 1 1 II 

CGGGAAGCAGTGCCTGAACCTGTGTTGCTCTCTCGGGCAGACGTGCGCCTGCTGAGGCTC 


490 


Db 


421 


480 


Qy 


491 


AAGTT AAAAGTGG AG CAG CA CGTGGAG CTATA C CAG AAATA CAG CAATG ATT C CTGG CG C 
I l l I l i I I i I i I I I I i I I I I i i i i i i i i i iiiiiiiiiii i i i i i mi iiiiiiii 

II 1 1 II 1 II 1 II M 1 II 1 II 1 II 1 II II 1 M II II 1 II 1 1 1 1 1 II 1 II II II II 1 1 

AAGTTAAAAG TGG AG CAG CA CGTGGAG CTGTA C CAGAAATAT AG CAA CAATTC CTGG CG C 


550 


Db 


481 


540 


Qy 


551 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


610 


Db 


541 


1 1 1 1 II M 1 1 II 1 II 1 1 II 1 IMIIMI M II MM 1 1 1 1 M II M 1 1 M 1 1 Ml 

TACCTCAGCAACCGGCTGCTCGCCCCCAGCGACTCACCGGAGTGGCTGTCCTTTGACGTC 


600 


Qy 


611 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

II 1 II 1 1 1 1 M II 1 1 II 1 II II 1 II 1 1 IMIIMI Mill II IIIIIMIIMI 

ACTGGAGTTGTGCGGCAGTGGCTGACCCACAGAGAGGAAATAGAAGGCTTTCGCCTCAGT 


670 


Db 


601 


660 


Qy 


671 


GCCCACTCTTCCTCTGAC^GCAAAGATAACA(^CTC(^CGTGGAAATTAACGGGTTaVAT 


730 


Db 


661 


MMM Mill MMM II IIIIIIII M II Mill II MINIUM 1 

GCCCACTGTTCCTGTGACAGTAAGGATAAC^CGCTTCAAGTGGAC^TC^CGGGTTCAGT 


720 


Qy 


731 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


790 



II 1 1 ' 1 1 1 1 : 1 II I 1 1 MIIIIIIIIIIMIIIIMII Mill IIIIIIIIMI 



Db 



721 



TCCGGCCGCCGGGGTGACCTCGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 



Qy 


791 


Db 


781 


Qy 


851 


Db 


835 


Qy 


911 


Db 


862 


Qy 


971 


Db 


922 


Qy 


1031 


Db 


982 


Qy 


1091 


Db 


1042 


Qy 


1151 


Db 


1102 


Qy 


1211 


Db 


1162 



II 



II 



MINIMUM! Mill IIIMIIIIMIIIIMIII II 1 1 1 1 II 1 1 M M M 1 1 



M I MMM I IMM MM! M Ml IM M! 1 1 1 II II 1 1 1 1 M M I M I I 



RESULT 10 
BC000125 

LOCUS BC000125 1780 bp mRNA linear PRI 12-JUL-2001 

DEFINITION Homo sapiens, Similar to transforming growth factor, beta 1, clone 

MGC:3119 IMAGE:3351664 , mRNA, complete cds . 
ACCESSION BC000125 

VERSION BC000125.1 GI : 12652748 

KEYWORDS MGC . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1780) 
AUTHORS Strausberg , R . 
TITLE Direct Submission 

JOURNAL Submitted ( 03 -NOV-2 000 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 

Email : cgapbs -r@mail .nih.gov 



Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra. 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row.- e Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Locat ion/Qua 1 i f iers 
1. .1780 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db__xref ="taxon: 9606" 

/clone="MGC:3119 IMAGE : 3351664 " 

/ 1 issue_type= 11 Eye , ret inoblas toma " 

/ clone_l ib= "NI H_MGC_1 6 " 

/lab_host="DH10B-R" 

/not e= "Vector : pOTB7 " 

447. .1619 

/codon__start = l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH00125 .1" 
/db_xref ="GI : 12652749" 

/ trans la tion= "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 11 
341 a 648 c 493 g 298 t 



Query Match 73.0%; Score 984.4; DB 9 

Best Local Similarity 85.2%; Pred. No. 3.7e-164 
Matches 1142; Conservative 0; Mismatches 151 



Length 178 0; 

Indels 47; Gaps 2; 



Qy 

Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

1 1 III Mil II IIIIIIIIIII III MINIMI 1 1 1 II I II 1 1 1 1 1 1 I 

44 2 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 501 



Qy 6 6 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 M M It M I ! 1 1 M 1 1 1 1 II ! M I Mill IMMMMMMI MINIUM 

Db 5 02 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 



Qy 12 6 AGCTGGTGAAGCGGAAGCGO\TCGAGGCCATTCGCGGCO\GATTCTGTCCAAGCTTCGGC 185 

IIMIMIIIIIIMIIMIIIIIIIIIIII llllllll IIMIIIIIII MM 

Db 562 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 621 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 245 

I 1 1 1 1 ! 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 IMMIIMIIIMMIMI MIM II II I 

Db 622 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 681 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 305 

I II llllllll 1 1 1 1 M 1 1 1 M 1 1 llllllll MM IIIMIIIIIIIIIII 

Db 682 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 741 

Qy 306 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

i mil 1 1 1 1 r 1 1 r 1 1 ! r 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 r i r 1 1 1 1 1 1 r 1 1 1 i mm 

Db 742 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 801 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCrArAOrTTATATZXTnrTnTTcaACACGTCGG 125 

IMIIIMM II MMM I I MMM II MMM I MM MM II I 

Db 802 AAATCTATGACAAGTTCAAGCAGAGTACACA.CAGCATATATATGTTCTTCAACACATCAG 861 

Qy 4 26 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

MMMI llllllll II Mill II llllllll MMIMMIMM lllllll 

Db 862 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 921 

Qy 4 86 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

MMMMMMMMMMMMMIMIMM IMIIIIIIIIIMIM MMM 

Db 922 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 981 

Qy 54 6 GGCGCTAGCTCAGCAACCGGCTGCTGGGCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill M MMM I II MM 

Db 982 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGGGACTCGCCAGAGTGGTTATCTTTTG 1041 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTGGCC 665 

MIMIIIIIIIIIIMIMIIMII Ml III III II II Mill lllllll 

Db 1042 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1101 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I M MMMI MM IIIIMM 1 1 1 1 f 1 1 1 1 M II Mill M II M I M 

Db 1102 TTAGCGCCCACTGGTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1161 

Qy 726 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I IIIIMM MMMMIMMMIMM MMMIMIMM lllllll 

Db 1162 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGG 1221 

Qy 78 6 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

i 1 1 1 1 1 . i 1 1 1 1 I ' I I I I ' I I : I .' ' 1 Mill MMM M I I I M I I 1 1 1 

Db 1222 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCArCGrrnA - 128 0 

Qy 846 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

1 1 1 1 1 1 1 1 llllllll MM 

Db 1281 GCCCTGGACACCAACTATTGCT 13 02 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 1 1 1 1 1 M 1 1 1 1 M M 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMIMIMM MMMI 

Db 13 03 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1362 



Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I MIIIIIIIIIMI II II M I M I ! I ! 1 1 1 1 1 1 1 II 1 1 MM Ml 

Db 13 63 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 22 

Qy 102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

I MM MM I Mill 1 Ml MMMMMIM 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1423 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 14 82 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I MIMMIMI 1 1 III I M 1 1 II I M 1 1 1 1 M 1 1 1 1 1 1 II I 1 1 M 1 1 1 MUM 

Db 148 3 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGGTGCCCA 1542 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 1 II I II I M 1 1 1 1 1 1 Ml Ml II 1 1 1 1 1 M 1 1 Ml I Ml I II M I M 1 1 1 II I M. I 

Db 154 3 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1602 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 65 

MMMMMIM II llllllll II I II I I I Mill I 

Db 1603 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCGGCCCCGGCAGGCCCGGCCCCAC 1662 
Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGGTGTATTTAAGGACATCGTGCCCCAAG 1317 

Ml 1 1 1 1 Ml II II 1 1 1 1 i 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 I Mllll 

Db 1663 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCAAG 1722 

Qy 1318 CCCACTTGGGATCGATTAAA 133 7 

Mill MM I I II 
Db 1723 CCCACCTGGGGCCCCATTAA 1742 



RESULT 11 

BC001180 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC001180 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:2323 IMAGE : 3356605, mRNA, complete cds . 
BC001180 

BC001180. 1 GI : 12654682 
MGC . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhmi; Hominidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R. 
Direct Submission 

Submitted ( 1 1 -DEC-2 000 ) National Institutes of Health, Mammalian 
Gene Collection (MGC), Cancer Genomics Office, National Cancer 
institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 



Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterf ield , 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra . 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 10 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 37097. 

Location/Qualifiers 

1. .1780 

/organism="Homo sapiens" 

/mol type="mRNA" 

/db_xref = 11 1 axon: 9606" 

/clone="MGC:2323 IMAGE : 3356605 " 

/tissue_type= M Eye, retinoblastoma" 

/ c 1 one_l ib= "NI H_MGC_1 6 " 

/lab_host="DH10B-R" 

/not e= " Vector : pOTB7 " 

447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protem_id="AAH0118 0. 1" 
/db_xref =»GI : 12654683" 

/ trans la t ion= "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNE I YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDS PEWLSFDVTGVVRQWLSRGGE I EGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS " 
341 a 648 c 493 g 298 t 



Query Match 73.0%; Score 984.4; DB 9 

Best Local Similarity 85.2%; Pred . No. 3.7e-164 
Matches 1142; Conservative 0; Mismatches 151 



Length 178 0; 
Indels 47; 



Gaps 



2; 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II Ml 1 1 1 1 II MINIUM, Ml MINIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

442 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 501 

66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I lllll I I I I I I I I I M I I I I I I [ I I I ! I I 

502 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 561 

126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

I I ] 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

562 AG CTGG TGAAG CGGAAG CG CATCG AGG CCAT C CG CGG CCAG AT CCTG T CCAAG CTG CGG C 621 

186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I [ I I I I I I lllll II II I 



Db 


622 


Qy 


246 


Db 


682 


Qy 


306 


Db 


742 


Qy 


366 


Db 


802 


Qy 


426 


Db 


862 


Qy 


486 


Db 


922 


Qy 


546 


Db 


982 


Qy 


606 


Db 


1042 


Qy 


666 


Db 


1102 


Qy 


726 


Db 


1162 


Qy 


786 


Db 


1222 


Qy 


846 


Db 


1281 


Qy 


906 


UD 


1 ~i n "3 


Qy 


966 


Db 


1363 


Qy 


1026 


Db 


1423 



622 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 681 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll llllllllllllll llllllll MM IIIMMMIIIIIII 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 741 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill 1 1 1 1 1 ! 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 01 



II llllll I I MINI llllllll I llllll 



AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll llllllll M Mill II llllllll llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 921 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

IIIIIIIIIIIIIIIIIIIIIIMIIIIMIIII hi II Mlh llllll 

GG CTCAAGTTAAAAGTGGAG CAG CA CGTGG AG CTGTACCAGAAATACAGCAACAATT CCT 981 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM llllllllllllllllllllill Mill Mill II llllll I II 1 1 1 1 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1041 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

IIIIIMMIMIIIMMMIMM III Ml Ml II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1101 
TC^GTGCC(^CTCTTCCTCTGACAG(^W^GATAACACACTCCACGTGGAAATTAACGGGT 725 

I II lllllll MM MM.; IIMIIIIMI II Mill M MIIMI 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 1161 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

Ml I I llllllll 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 MIIMI 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1221 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MIIIIMIIIIIIMM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 128 0 

CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

llllllll llllllll I I I I 
GCCCTGGACACCAACTATTGCT 13 02 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 II 1 1: 1 1 1 1 ! 1 1 1 1 1 1 1 L Ml I II I II 1 1 1 1 1 1 llllllllllllll MIIMI 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1362 
TGGG CTGGAAG TGG A TT CATG AA C C CAAGGG CT A C CATGC CAATTT CTG CCTGGGG C C CT 102 5 

I 1 1 1 1 1 1 1 II 1 1 II I M II llllllllllllllllllll 1 1 1 r 1 1 1 1 MIIMI 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 22 
GTCCCTACATCTGGAGCCTAGACACTC^GTACAGC^GGTCCTGGCTCTGTAC^ACCAGC 1085 

I llllllll llllllll Mill llllllllllllllllllll I'M! 'MM 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 14 82 



Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 1145 

I lllllllllll IMIIIIIIIIMMIIIIIIIIIMIMMIIIIIII Mill I 

Db 14 83 ATAA CCCGGG CG CCTCGG CGG CG C CGTG CTG CG TGC CG CAGG CG CTGGAG CCG CTGCC CA 1542 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII I 

Db 154 3 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1602 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

II 1 1 1 1 1 1 1 1 ! II 1 1 1 1 lllllllllll II I II I I I Mill I 

Db 1603 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1662 
Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

III MM III II M 1 1 ! 1 1 1 1 : 1 1 1 1 1 1 ! 1 1 1 I MUM 

Db 1663 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCAAG 1722 

Qy 1318 CCCACTTGGGATCGATTAAA 133 7 

Mill MM I I II 
Db 1723 CCCACCTGGGGCCCCATTAA 174 2 



RESULT 12 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



AGMTGFB 1561 bp mRNA linear PRI 27 -APR- 1993 

Simian transforming growth factor-beta (TGF) mRNA, complete cds . 
M16658 

M16658 .1 GI : 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Cercopi thecidae ; 
Cercopithecinae; Cercopithecus . 
1 (bases 1 to 1561) 

Sharpies, K., Plowman,G.D. , Rose,T.M., Twardzik, D . R . and 
Purchio, A. F . 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 
Locat ion/Qual if iers 
1. .1561 

/organi sm= " Cercopithecus aethiops " 
/mol_type=" mRNA " 
/db__xref = n taxon: 9534" 
262. .1434 

/note=" transforming growth factor-beta precursor" 
/codon_start=l 
/protein_id="AAA35369 . 1" 
/db__xref="GI : 176553" 

/ translat ion= "MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 



KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDT 
QYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
sig_peptide 262. .1095 

/note=" transforming growth factor-beta signal peptide" 
mat_peptide 1096. .1431 

/product =" transforming growth factor-beta" 

BASE COUNT 301 a 547 c 446 g 267 t 

ORIGIN 

Query Match 72.9%; Score 983; DB 9; Length 1561; 

Best Local Similarity 85.6%; Pred. No. 6.6e-164; 

Matches 1148; Conservative 0; Mismatches 145; Indels 48; Gaps 3; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III I I I I II llllllllll IIIIIIIIMIII I llllllllll I 
CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 316 

TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 1 , 1 1 1 1,1 1 1 1 M I ! M 1 1 1 1 1 Mill llllllllllllll llllllllll 

TGGTGCTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGG 376 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 II I II 1 : 1 1 1 1 1 1 1 Ml 1 1 MM IIIIMIIII MM llllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 4 36 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 245 

I 1 . 1 1 1 1 1 1 1 1 ! M ] 1 1 1 i I III 1 1 1 1 1 II I II M 1 1 1 1 1 II I Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 4 96 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II 1 1 1 1 1 1 1 1 1 1 1 1 1 II F 1 1 1 1 1 1 1 1 1 1 f 1 1 1 MM II IMMIIIMI I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 5 56 

CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I I < I I I : I I I I , I I I , I I I I i I i ' ' ! I I ' I : I i . I I I I I MM 
CGGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 616 

AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

llllllllll II MUM I I I MUM 1 1 1 1 1 1 1 1 I IMMIII M I 

AAATCTATGACAAGTTCAAGCAGAGCA 676 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

1 1 1 1 [ 1 1 Mill II M MMMM 1 1 1 1 1 1 1 1 1 1 M I f ! I II 1 1 f I 1 1 1 ! 1 1 1 

AGCTCCGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 7 36 
GGCTCAAGTTAAAAGTGGAG(^GC^CGTGGAGCTATAC(^GAAATACAGCAATGATTCCT 54 5 

1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 II M 1 1 1 IMMIII 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 llllll 



Qy 


6 


Db 


257 


Qy 


66 


Db 


317 


Qy 


126 


Db 


377 


Qy 


186 


Db 


437 


Qy 


246 


Db 


497 


Qy 


306 


Db 


557 


Qy 


366 


Db 


617 


Qy 


426 


Db 


677 


Qy 


486 


Db 


737 


Qy 


546 


Db 


797 


Qy 


606 



MM IIIIIIIMIIIIIIMIIII Ml MM Ml 1 1 1 1 MMI MMMI 



Db 8 57 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCC 916 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II 111,11 I I II IIIIIMMIIIIIIIII II II Mill II ,1111 I 
Db 917 TTAGCGCCCACTGCTCCTGTGAC^GCAAAGATAACACACTGa^AGTGGACATCAACGGGT 976 

Qy 726 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I Illlllll I I , I I I I I I I I I I I Mill I I II I III I I I I II lllllll 
Db 977 TCACTACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 1036 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I Ml ' MM 1 1 I 1 I I I I I I I 1 I : II HIM I I I I I I I | | I | | I | | | | | 
Db 103 7 TTCTCATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA- 1095 

Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

Illlllll lllllllllllll 
Db 10 96 GCCCTGGACACCAACTACTGCT 1117 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

II 1 1 1 1 1 1 II I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M I III 1 1 II III MUM lllllll 

Db 1118 TCAGCTCGACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACC 1177 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I ' III MMI II II IMMMMMIM MM 1 1 1 1 1 1 1 II 1 1 1 II 1 1 

Db 1178 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCT 1237 

Qy 102 6 GTCCCTAC^TCTGGAGCCTAGAO^CTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

MINIUM Illlllll Mill 1 . 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 lllllllllllll 

Db 123 8 GTCCCTACATTTGGAGCCTGGA^CG^GTACAGOUVGGTCCTGGCCCTGTACAACCAGC 12 97 

Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I MUM II h II 

Db 12 98 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 13 57 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

. MM . I I 1 I I I I 1 II MM I 

Db 13 58 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1417 

Qy 12 06 CCTGC^GTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 65 

lllllll Illlllll 1 1 1 1 1 1 ! 1 1 1 . [ i i Illlllll Mill II 

Db 1418 CCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGC 1477 

Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAA 1316 

Illlllll Mill I I I I I I I I I I I I I I I I I I I | MIMIIMI 

Db 14 78 CCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 1537 

Qy 1317 GCCCACTTGGGATCGATTAAA 1337 

MIMI MM I I II 
Db 153 8 GCCCACCTGGGGCCCCATTAA 1558 



RESULT 13 
E03028 

LOCUS E03028 1821 bp RNA linear PAT 29-SEP-1997 

DEFINITION DNA encoding human prepro TGF-betal . 

ACCESSION E03028 

VERSION E03028.1 GI: 2171250 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



JP 1991180192-A/l . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1821) 

Ohashi,H., Ishii,Y., Miyata,Y., Miyazono , K. , Miyagawa,K. and 
Takaku , F . 

PRODUCTION OF HUMAN PR0-TGF-BETA1 BY GENETIC RECOMBINATION 

Patent: JP 1991180192-A 1 06-AUG-1991; 

KIRIN BREWERY CO LTD 

OS Homo sapiens (human) 

PN JP 1991180192-A/l 

PD 06-AUG-1991 

PF 07-DEC-1989 JP 1989318243 

PI OHASHI HIDEYA, ISHII YASUYUKI , MIYATA YOSHINORI , PI MIYAZONO 
KOHEI , 

PI MIYAGAWA KIYOSHI, TAKAKU FUMIMARO 

C12P2l/00,C07K13/00,C12N5/10,C12N15/18//C12Ql/68, (C12P21/00, 
C12R1 : 91) , 

(C12N15/18 # C12R1:91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
ant i- sense: No; 

♦source : tissue_type=placenta ; 
♦source: c 1 one =p VC 1 9 - TGF ; 



PC 
PC 
PC 
CC 
CC 
CC 
CC 
CC 
CC 
FH 
FH 
FT 
FT 



Key 

5 'UTR 
CDS 



FT 

sig_peptide 511 
FT mat_peptide 
FT 
FT 
FT 
FT 



mat_peptide 



3 'UTR 



Location/Qualifiers 

1. .510 
511. .1683 

/product= 'human prepro TGF-betal' FT 
. 597 

598. .1680 

/not e= f human pro TGF-betal' 
1345. .1680 

/product = 'human TGF-betal' 
1684. .1820. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Location/Qualifiers 
1. .1821 

/organi sm= " Homo sap iens " 
/mol__type= "genomic RNA" 
/db_xref ="taxon: 9606" 
326 a 679 c 508 g 308 t 



Query Match 72 . 7%; 

Best Local Similarity 85.6%; 
Matches 1133; Conservative 



Score 980.2; DB 6; Length 1821; 
Pred. No. 2e-163; 
0; Mismatches 143; Indels 47; 



Gaps 



Qy 



Db 



6 CCGAGATGG CG C CTT CGGGG CTG CGG CT CTTG C CG CTG CTG CTG CCG CTG CTGTGG CTGC 6 5 

II III 1 1 1 1 II lllllllllll lllllllllllll IIIIIIIIIIIIII I 

5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 



Qy 



Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I IIIIIIIIIIIIIIII IIIIIIIII Mill IIIIIIIIIIIIII IIIIIIIIII 
567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 626 



Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

II II MM III III 1 IMIMIMM Mill lllllllllll lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


185 


Db 


627 


686 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 MM MM IMIIMI! M lllllllllllllllillll MMI II II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


687 


746 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll IIIIIIIIIIIIM llllllll MM MMMMMIMMI 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


747 


806 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 . 1 1 1 1 1 1 ! 1 ! 1 1 1 1 1 1 1 1 1 : 1 j 1 1 1 1 1 1 j ! 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


807 


866 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 1 1 1 1 1 1 1 ! 1 II MMM 1 1 MMM llllllll 1 M 1 M 1 II II 1 

AAATCTATGAC^GTTCT^GC^GAGTACAC^CAGCATATATATGTTCTTCAACACATCAG 


42S 


Db 


867 


926 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll llllllll II Mill II llllllll llllllllllllll lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


927 


986 


Qy 

Db 


486 
987 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

IIIIIIIIMIIIIIIIIIIIIIIIIIMMIM IIIIIIIIMIIIIMI MMM 

GGCTC^GTTAAAAGTGGAGC^GCACGTGGAGCTGTACC^GAAATAC^GCAACyVATTCCT 


545 
1046 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


1047 


MM II Ml Ml 1 M lllllllll II Mill Mill M MMM 1 M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


1106 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

Mill MM IIIIMMI II Ml III Ml II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1107 


1166 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II lllllll MM llllllll 1 1 1 1 ! [ 1 1 1 1 1 II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGC^GGGATAACAC^CTGCAAGTGGACATCAACGGGT 


725 


Db 


1167 


1226 


Qy 
Db 


726 
1227 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 llllllll IMMMIIMMIMIII II III II M 1 1 1 1 1 lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 
1286 


Qy 


786 


TC CT CATGG C CA C C CCG CTGGAGAGGG C C CAG CA CCTG CA CAG CTC C CGG CAC CG CCG AG 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 
1 II 1 II 1 II M II II 1 II II 1 1 1 1 II II II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1287 


1345 


Qy 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

Ml Ml llllllll MM 


905 


Db 


1346 


1367 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

MMMMIMMIMMMMMMMMMIMM llllllllllllll lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


965 


Db 


1368 


1427 



Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAA.GGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

I 11,1 II II 1 1 1 1 1 1 !l 1 1 ; I I I II III I Mill I Mill 

Db 1428 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 8 7 

Qy 102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

I MMMM MMMM INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M MMMIMMM 

Db 14 88 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 154 7 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I MMMMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 M 1 1 1 1 II 

Db 154 8 ATAA C C CGGG CG CCT CGG CGG CG C CGTG CTG CGTG CCG CAGG CG CTGGAG CCG CTG CC CA 1607 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 J I !! I i 1 1 1 1 1 1 1 [ I I | 

Db 16 08 T CG TGTA CT A CG TGGG C CG CAAG CC CAAGG TGG AG CAG CTGTC CAA CATG AT CGTG CG CT 1667 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

I II I 1 1 1 1 1 1 1 Mill 1 I 1 1 1 1 1 1 1 1 II I II I I I Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 172 7 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGA CAT CGTG CCCCAAG 1317 

III I II I III II II II I M M I II MM II II I MMMM I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 178 7 

Qy 1318 CCC 1320 

Db 1788 GCC 1790 



RESULT 14 

BC022242 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC022242 1746 bp mRNA linear PRI 04-FEB-2002 

Homo sapiens, clone MGC: 22008 IMAGE : 4399762 , mRNA, complete cds . 
BC022242 

BC022242 .1 GI : 18490115 
MGC . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1746) 

Strausberg,R. 

Direct Submission 

Submitted ( 01 -FEB-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact : MGC help desk 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 



Dickson, M. # Schmutz, J., Grimwood, J . , Rodriquez, A., and Myers, 
R. M. 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 27 Row: e Column: 21 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Locat ion/Qual i f iers 
1. .1746 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref ="taxon: 9606" 

/clone= "MGC: 22 0 08 IMAGE : 43 99762 " 

/ 1 is sue_type= " Duodenum , adenocarc inoma 11 

/ c 1 one_l ib= " N I H_MGC_8 8 " 

/lab_host="DH10B" 

/note="Vector: pCMV-SP0RT6" 

370. .1542 

/codon_start=l 

/product = "Unknown (protein for MGC:22008) M 
/protem_id="AAH22242 . 1" 
/db_xref="GI : 18490116" 

/ 1 rans 1 a t i on = " MPPSGLRLLLLLLPLLWLLVLTPGR PAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLSRGGE I EGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYIiANFCLGPCPYIWSLDT 
QYSKVTALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
376 a 612 c 472 g 286 t 



Query Match 72 . 6%; 

Best Local Similarity 85.5%; 
Matches 1146; Conservative 



Score 979.8; DB 9; 
Pred. No. 2,4e-163; 
0; Mismatches 147; 



Length 1746; 
Indels 48; 



Gaps 



3; 



Qy 
Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III I I I I II lllllllllll III lllllllll llllllllllllll I 
3 65 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 4 24 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I llllllllllllllllllllllllll Mill llllllllllllll llllllllll 
425 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 4 84 



126 



185 



AG CTGGTGAAG CGG AAG CG CATCGAGG CCATT CG CGG C CAGATTCTGT C CAAG CTT CGG C 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIII lllllllllll MM!!!!!!! MM 

4 85 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 544 



186 



245 



TTG C CAG C CCC CCGAG C CAGGGGG ACGTG C CG C CCGG C C CGCTG CCTGAGG CAGTA CTGG 

I lllllllllllllllllllllll II 1 1 1 II 1 1 1 1 1 M 1 1 1 II I Mill II II I 

54 5 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 6 04 



246 



305 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

I II IMIIMI llllllllllllll IIMIIII MM MIMIMMIIIMI 

6 05 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 664 



Qy 


306 


C^GAGGCGGACTACTACGCC^AGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

i mil hi iiiiiiii minimum: Minn in i mm 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


665 


724 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

'Mill II MINI 1 1 MINI III IM 1 llllllll II 1 

AAATCTATGACAAGTTC^GCAGAGTAC^C^C^GCATATATATGTTCTTCAACACATCAG 


425 


Db 


725 


784 


Qy 
Db 


426 
785 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll llllllll II Mill 1 1 llllllll MUM Mill lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 
844 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

1 1 j 1 1 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 IIIIIMIIIIIIIMI MINI 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


845 


904 


Qy 
Db 


546 
905 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM IIIIIIIIIIMIMMIIMM Mill MIM M IMM 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 
964 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

IIIIIIIIIIMIIIIIIIIMIIII III III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


965 


1024 


Qy 


666 


TCAGTGCCC^CTCTTCCTCTGACAGC^AAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II lllllll MM llllllll Illllllilll II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


725 


Db 


1025 


1084 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III 1 1 llllllll 1 1 1 M II 1 M 1 1 1 1! 1 1 M IIIIIIIIMIIII lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1085 


1144 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 II 1 II Ml III 1 M 1 II 1 II 1 II 1 1 1 1 M Mill IIMMMIMMIMM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1145 


1203 


Qy 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 


905 


Db 


1204 


llllllll llllllll MM 
GCCCTGGACACCAACTATTGCT 


1225 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

MMIIIIMMIMMMMMMIIIMIIIMM IIIIIIIIMIIII lllllll 

TC^GCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 


965 


Db 


1226 


1285 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 I I I I 1 llllllll lllllll 
1 1 II II II 1 II 1 1 1 1 M II 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 II 1 llllllll lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1025 


Db 


1286 


1345 


Qy 


1026 


GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

1 llllllll ' .Mill 1 Mill Ml Ml MM Ml MIMMMIMI 

GCCCCTACATTTGGAGCCTGGACACGC^GTACAGCT^GGTCCTGGCCCTGTAC^ACCAGC 


1085 


Db 


1346 


1405 


Qy 


1086 


ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 

1 N J IN 1 1 II M 1 1 1 M 1 1 II M 1 1 II II II 1 M II II 1 II 1 1 1 lllllll 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 


1145 


Db 


1406 


1465 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 



114 6 TCGTGTACTACGTGGGCCGCAAGCCC^GGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 1 1 1 i 1 1 ! 1 1 ! I ! 1 1 1 i 1 1 ! i II 1 1 1 i I 1 1 1 1 ! I i 1 1 1 1 1 1 1 1 ; 1 1 1 1 1 : ; I 

14 66 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 152 5 
12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

III 1 1 1 M I M II I II M 1111:11.1111 II I M I I I HIM I 

1526 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 158 5 
12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 1316 

ill 1 1 1 1 iii ii ii i : I I I M I I I I I I I M I I II Mill 

15 86 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 164 5 

1317 GCCCACTTGGGATCGATTAAA 133 7 

MUM MM I I II 
164 6 GCCCACCTGGGGCCCCATTAA 1666 



RESULT 15 

106216 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



106216 
Sequence 
106216 
106216. 1 



2 from Patent 



GI :590649 



1560 bp 
EP 0293785. 



DNA 



linear PAT 02-DEC-1994 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1560) 

Purchio, A. F. , Gentry, L. and Twardzik,D. 

Cloning and expression of simian transforming growth factor-SSI 
Patent: EP 0293785-A2 2 07-DEC-1988; 

Location/Qual if iers 

1. .1560 

/ organi sm= " unknown 11 
301 a 547 c 442 g 267 t 3 others 



Query Match 72.5%; 
Best Local Similarity 85.6%; 
Matches 1143; Conservative 



Score 978.6; DB 6; 
Pred. No. 4e-163; 
0; Mismatches 145; 



Length 1560; 
Indels 48; 



Gaps 



3; 



Qy 
Db 



1 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 7 0 

III MM II llllll III, II ,111 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 

2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 



Qy 

Db 

Qy 

Db 

Qy 
Db 



71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

MINIMI Mill Mill! I Mill 1 1 1 1 II 1 1 1 1 1 1 1 1 III llllll 

321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 

131 GTG AAG CGGAAG CG CAT CGAGG C CATTCG CGG CCAGATTCTGT C CAAGCTTCGGCTTG CC 190 

Ml I II Ml M I II 1 1 1 M N MM Mil IMIMIIIII 1 1 1 f I III 

381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 2 50 

I II I M Ml II Ml 1 1 1 1 M II I I M M I 1 1 I 1 1 M Mill II II II II 

4 41 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 0 0 



Qy 2 51 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

Illllll MINIMI Illlllll 1 1 1 1 II Ml, II Ml 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 

Qy 311 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 370 

1 1 1 1 1 1 M I M I M MM 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I MM lllll 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

Qy 371 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

MMI 1 1 MMI I I I MINI Illlllll I Illlllll II MMI 

Db 621 TATGACAAGTTCAAGCAGAGCACACAC^ 68 0 

Qy 431 CGGGAAG CGGTG C CGGAAC CTGTATTGCT CTCTCGGG CAGAG CTG CGC CTG CTGAGGCT C 4 90 

II lllll II II Illlllll Illlllll II I MMI ' III 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGGTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 

Qy 4 91 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 550 

M M M M M I I l| I I ! M M I M M I M M I M I M 1 1 I I I I I II II I I I I I I 

Db 741 AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 00 

Qy 551 TA CCT CAG CAAC CGG CTGCTGG C CCC CAG TGA CTCA CCGGAGTGG CTGT C CTTTGATGT C 610 

Illllllllllllllllllllll lllll I I I I lllllllll I I I I lllllllll 
Db 8 01 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 860 

Qy 611 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

lllllllllllllllllllll III I I I I III II II lllll Illlllll II 
Db 861 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 

Qy 671 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 73 0 

Illllll MM lllllllllllllllllllll II Mill II Illl lll I 
Db 921 GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 98 0 

Qy 731 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 90 

I Illlllll I Ml II I I I I I I I I Mill MM lllllllll Illlllll III 
Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 7 91 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 5 0 

I II M II 1 1 1 1 II Ml Ml M 1 1 II I II lllll 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 

Qy 851 GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 910 

Illlllll llllllllllllllllll 
Db 10 95 GCCCTGGACACCAACTACTGCTTCAGC 1121 

Qy 911 TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 97 0 

1 1 1 1 : 1 1 1 i ! ! 1 1 1 i 1 1 MM iM II I II 'M Illlllll Ml 

Db 1122 TCCACGGAGAAGAACTGCTNCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 1181 

Qy 971 TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 103 0 

lllllllll II II I II 1 1 III 1 1 1 M 1 1 1 II I, Ml 1 1 III M 1 1 1 II 1 1 1 III 

Db 1182 TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 1241 

Qy 1031 TACA/TCTGGAGCCTAGACACTCAGTACA^ 1090 

lllll Illlllll MMI II lllll II III I III MM I II llllll I MM III 

Db 1242 TACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 13 01 



Qy 



10 91 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 115 0 



MINIM M M MM IIMI MMM I II MMM I Ml II! MMMM Ml 

Db 13 02 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1361 

Qy 1151 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1210 

I M . 1 1 ; 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 M : 1 1 : 1 1 1 1 1 1 1 1 ! I i I MM 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 

Qy 1211 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 1270 

I I MM, II IMM MMM M I M I I I Mill 

Db 14 22 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGCCCCAC 1481 
Qy 1271 CCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAAGCCCA 1321 

Ml Ml II I Mill! II 1 1 1 1 1 1 1 1 1 IMMMIMMMI 

Db 14 82 CCCACCCCCGGTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCA 1541 

Qy 13 22 CTTGGGATCGATTAAA 1337 

I MM I I II 

Db 154 2 CCTGGGGCCCCATTAA 15 57 

Search completed: October 27, 2003, 18:35:01 
Job time : 5270.34 sees 

GenCore version 5.1.6 
Copyright (c) 19 93 - 2 003 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 405.282 Seconds 

(without al ignment s ) 
8985.201 Million cell updates/sec 

Title: US-10-017-372E-32 
Perfect score: 1349 

Sequence: 1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 134 9 

Scoring table: IDENTITYJSfUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 5105512 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : N_Geneseq__l 9Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1980 . DAT: * 
2 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1981 .DAT: * 
3 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 . DAT : * 
4 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1983 .DAT: * 
5 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 .DAT: * 



6 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1985 .DAT: * 

7 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1986 . DAT: * 

8 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987 . DAT: * 

9 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1988 .DAT: * 
10 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 9 . DAT: * 
11 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1990 . DAT: * 
12 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1991 .DAT: * 
13 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 .DAT: * 
14 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1993 .DAT: * 
15 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 .DAT: * 
16 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1995 . DAT : * 
17 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1996 . DAT : * 
18 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 . DAT: * 
19 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 .DAT: * 
2 0 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT: * 
21 : / SIDSl/gcgdata/geneseq/ geneseqn-embl/NA2 00 0 . DAT : * 
22 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A. DAT: * 
23 : / SIDSl/gcgdata/geneseq/ geneseqn-embl /NA2 001B . DAT : * 
24 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 .DAT: * 
25 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2003 .DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



1223 . 8 
991.6 
990. 8 
990. 8 
990.8 
990. 8 
989.2 



981 .4 
980. 6 
980.2 
979.6 
979. 6 
977 . 8 
977 . 8 
969. 8 
968.8 
966.8 
961. 8 
958 . 6 
952.6 
949 . 4 
930.2 
928 . 6 
928 . 6 
928 . 6 



986 
983 



90.7 
73 .5 
73 .4 
73 .4 
73 .4 
73 .4 
73 .3 
73 . 1 
72 . 9 
72 .8 
72 .7 
72 . 7 
72 .6 
72 .6 
72 . 5 
72 . 5 
71 . 9 
71.8 
71 . 7 
71.3 
71 . 1 
70.6 
70.4 
69. 0 
68 . 8 
68 .8 
68.8 



1326 
2527 
2537 
2537 
2537 
2537 
2537 
2537 
1561 
2742 
1559 
1821 
1560 
1560 
2745 
2745 
1303 
4105 
1571 
1569 
1569 
1561 
1561 
1176 
1176 
1176 
1176 



24 
25 
7 

11 
11 
17 
15 
19 
11 
22 
13 
12 
9 

11 
16 
22 
11 
15 
11 
9 

11 
11 
13 
25 
24 
24 
24 



AAD22696 
ABQ76674 

AAN60972 
AAQ03301 
AAQ02814 
AAT15720 
AAQ56923 
AAV52933 
AAQ03268 
AAI58342 
AAQ2028 9 
AAQ13392 

AAN81084 
AAQ03508 
AAT05876 
AAH28216 
AAQ09317 
AAQ55624 
AAQ03269 

AAN81085 
AAQ03509 
AAQ04908 
AAQ29177 
ABV753 91 
ABZ35738 
ABX09981 
ABV78162 



cDNA encoding huma 
Sequence of pre-TG 
Pre-transforming g 
Human pre-TGF-beta 
Human pre-transfor 
Simian transformin 
Human polynucleoti 
Sequence encoding 
Human pro -TGF -beta 

Coding sequence of 
Simian Transformin 
cDNA encoding tran 
Nucleotide sequenc 
Monkey transformin 
TGFbetal 5'-UTR-CD 
Human transforming 

Coding sequence of 
Human Transforming 
Sequence encoding 
TGF-beta 1/beta 2 
TGFB1 Arg25Pro pol 
Human TGF beta 1 p 
Human TGFbetal DNA 
Human TGF beta 1 D 



Porcine transformi 
Androgen receptor 
Sequence encoding 



? ft 
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6ft 
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1176 
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ART.91 70^ 


HnmAn Dnlvnurleoti 


? 9 


928 


.6 


D O . 


Q 
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1 1 76 

J. J. / o 




r\ ld v / j o ^/ £i 


THFR1 Ara25Pm nol 


30 


908 


67 


3 


1565 


13 


AA02 91 7 8 


TGF-beta 1 Homo 


3 1 


805 


59 


, 7 


2765 


22 


AAI 60128 


Human polynucleoti 


32 


788 


.2 


58 


, 4 


2208 


13 


AAO2 02 91 


^pnnpnrp encndincr 


33 


786 


.6 


58 . 


. 3 


2206 


1 1 


AAO03510 


Human Transforming 


34 


786 


.6 


58 


. 3 


2207 


1 1 


AAO03511 


Hvbrid transfnrmin 

11 V A—/ X. -L SmL w X \-A 1 1 %mj mm, Am i \ 1 Am A * 


35 


785 


58 


. 2 


2207 


1 1 


AAOO^ 1 2 7 

July \j> *j x Ca < 


Human TGF-Befa 1 /TO 


36 


773 


.6 


57 


. 3 


2217 


10 


AAN90768 


^pmipnrp of hninan 


37 


767 


. 4 


56 . 


. 9 


2200 


16 


AAT04115 


Simian -human hybri 


38 


696 


.8 


51. 


. 7 


2773 


23 


AAS84421 


DNA encoding novel 


39 


678. 


.8 


50. 


.3 


834 


12 


AAQ12192 


Sequence encoding 


40 


654 


48 . 


.5 


1376 


24 


ABK90341 


DNA encoding LAP-m 


41 


652 . 


.4 


48 . 


.4 


1389 


24 


ABK90344 


DNA encoding LAP-h 


42 


588 . 


.2 


43 . 


. 6 


1352 


24 


ABK90342 


DNA encoding mIFNB 


43 


586. 


.6 


43 . 


, 5 


1350 


24 


ABK90343 


DNA encoding huIFN 


44 


534. 


.8 


39. 


.6 


875 


23 


AAS70979 


DNA encoding novel 


45 


366. 


. 4 


27. 


.2 


489 


24 


ABL99528 


Target canine gene 



ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 standard/ cDNA; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 
KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 
KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 
KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 16. . 1188 

FT /*tag= a 

FT /product^ "Porcine TGF-betal mutant protein" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001 . 
XX 

PF 20-APR-2001; 2 0 0 1W0-US12 98 0 . 
XX 

PR 20-APR-2000; 2 00 0US - 1 99 014 P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ ; 
XX 



DR WPI; 2002-026155/03. 

DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered usinq a deliverv system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

xx 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 90.7%; Score 1223.8; DB 24; Length 1326; 

Best Local Similarity 96.4%; Pred. No. l.le-240; 

Matches 1299; Conservative 0; Mismatches 7; Indels 41; Gaps 3; 
Qy 1 TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 6 0 

h 1 1 1 1 MM 1 1 1 II 1 1 II III I III II 1 1 1 II II ! I II II I II II 1 1 1 1 III II , I 

Db 6 TGGTACCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTG 65 

Qy 61 GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 0 

MMMMIIMIMIIMM MIMIII IMIIMIM llllll MIMMMM 

Db 66 GCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGA 12 5 

Qy 121 CATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCT 18 0 

MIM III MMMM Mill Ml III II M Ml III IMMIIMM M I III MM 

Db 126 CATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCT 18 5 

Qy 181 TCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGT 24 0 

MMM MMMMMMMMMMMMMMMMMMMMMMMMM 1 1 

Db 186 TCGGCTCGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGT 24 5 

Qy 241 ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 3 00 

II Ml III IM Mill Mill Ml Ml II M Ml MMMM MMM III Ml MM 

Db 24 6 ACTGGCTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCC 305 

Qy 3 01 CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 3 60 

MMMM Ml MMMM M I M M M I M M M I M M M M MMMM MMMM 

Db 3 06 CGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGG 3 65 



Qy 361 CAACCAAATCTATGATAAATTO^GGGC^CCCCC(^(^GCTTATATATGCTGTT(^CAC 42 0 

1 1 1 1 II 1 1 1 1 1 II I 1 1 ! 1 1 1 M II 1 1 1 1 1 1 1 1 1 i I M I ! II 1 1 1 1 1 1 1 M 1 1 1 1 1 II 

Db 366 CAACCAAATCTATGATAAATTCAAGGGCACCC^ 42 5 

Qy 421 GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 48 0 

mi i ii 1 1 1 1 1 ii i 1 1 1 1 1 1 ii 1 1 1 ii 1 1 1 ii 1 1 1 ii 1 1 ii 1 1 1 1 1 1 1 : i ii I 1 1 in 

Db 42 6 GTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCT 48 5 

Qy 481 GCTGAGGCTCAAGTTAAAAGTGGAGGAGCACGTGGAGCTATACCAGAAATACAGCAATGA 54 0 

II I I I I I I I I II I I I I I I I I II I I I I I I, I I I I I I II I I II I I I I I I I I II I M I II 
Db 4 86 GCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGA 54 5 

Qy 541 TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 600 

II 1 1 1 1 1 1 1 ill 1 1 1 M 1 1 1 M 1 1 1 1 1 1; 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 ii I :i I in 

Db 54 6 TTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTC 60 5 

Qv 601 CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 66 0 

' Ii I Ml 1 1 1 1 II 1 1 1 II I ! I II 1 1 II 1 1 : 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 II 1 1 II I III II; 

Db 606 CTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTT 665 

Qy 661 TCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 72 0 

IMIMIMIIIIIIIIIIIIIIMIIIMIIMIIIIIIIIIIIIMIIIIIIMIIII 

Db 666 TCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAA 72 5 

Qy 721 CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 78 0 

: II I Ml 1 1 1 1 ii 1 1 1 II 1 1 1 II I Ml 1 1 1 1 1 1 1 1 ll 1 1 1 II I ; 1 1 i 1 1 1 II I ii I in 

Db 726 CGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTT 785 

Qy 781 CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

Db 786 CCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCG 84 5 

Qy 841 CCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTA 90 0 

MM MMMMMMMMI 

Db 84 6 CCGA GCCCTGGATACCAACTA 86 6 

Qy 901 CTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAA 960 

MMMMMMMIMMMMMMMMMMMMMMMMMMMMMMI 

Db 8 67 CTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAA 92 6 

Qy 961 GGACCTGGGCTGGAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGG 102 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

Db 927 GGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGG 98 6 

Qy 1021 GCCCTGTCCCTACATCTGGAGCCTAGAC^CTC^GTAC^GC^AGGTCCTGGCTCTGTACAA 108 0 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

Db 987 GCCCTGTCCCTAC^TCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAA 104 6 

Qy 1081 CCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACT 114 0 

MMMMMMMMMMMMMMMMMIMMMMMMMMMMM IM 

Db 104 7 CCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACT 1106 

Qy 1141 GCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGT 1200 

IMMMMMMIMMMMMMMMMMMMMIMMMMMMMMMI 

Db 1107 GCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGT 1166 



Qy 

LJU 


1201 

1 1 £7 
1 io / 


GCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGC 

Illlllllllllllllllllll :l 1 1 1 1 II 1 M 1 M 1 1 1 III 1 III 1 1 1 1 1 1 1 1 1 1 1 II 


1260 
1225 


Qy 


1261 


CCGGCCCC^CCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCC 

II II Ml 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 II 1 1 ' 1 M 1 1 1 1 II 1 1 1 III 1 III 1 1 1 1 1 1 ! 1 1 1 1 1! 

CCGGCCCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCC 


1320 


Db 


1226 


1285 


Qy 


1321 


ACTTGGGATCGATTAAAGCGGCCGCGA 134 7 

II llllll llllll; 1 I II 
AC - TGGGATCGATTAAAGGTGGAGAGA 1311 




Db 


1286 





RESULT 2 
ABQ76674 

ID ABQ76674 standard; DNA; 2527 BP. 
XX 

AC ABQ76674; 
XX 

DT 26-MAR-2003 (first entry) 
XX 

DE Androgen receptor signalling pathway-associated DNA E00973 . 
XX 

KW Androgen receptor; transact lvat ion ; modulator; Smad3 ; Smad4 ; Akt ; TGF-B; 
KW signal transduction pathway; transforming growth factor-B; phosphatase; 
KW tensin; cytostatic; antiproliferative; cellular proliferation; cancer; 
KW E00973; ds . 
XX 

OS Synthetic. 
XX 

PN WO200282081-A2 . 
XX 

PD 17-OCT-2002. 
XX 

PF 05-APR-2002; 2 002WO-US11086 . 
XX 

PR 06-APR-2001; 2001US-282266P . 
PR 13-MAR-2002; 2 002US - 3 65060P . 
XX 

PA (UYRP ) UNIV ROCHESTER. 
XX 

PI Chang C; 
XX 

DR WPI; 2003-046871/04. 
XX 

PT Modulating androgen receptor activity, by administering a compound that 

PT modulates receptor activity, inhibits receptor-signal transduction 

PT pathway/receptor-coactivator interaction or changes amount or receptor 

PT 

XX 

PS Disclosure; Page 225-226; 302pp; English. 
XX 

CC This invention describes a novel method for modulating androgen receptor 

CC activity or androgen receptor-mediated transactivation activity in a 

CC cell. The method involves administering a compound which causes 

CC modulation of the androgen receptors activity and the inhibition of 

CC interaction between the receptor and a protein involved in a signal 



CC transduction pathway. The compound also inhibits the interaction between 

CC the androgen receptor and a protein selected from Smad3 , Smad4 , Akt , 

CC transforming growth factor (TGF) -B and phosphatase and tensin homologues 

CC deleted on chromosome 10 ( PTEN) or their fragments. The compounds of the 

CC invention have cytostatic and antiproliferative activity. The obtained 

CC composition is useful for treating any disease, where uncontrolled 

CC proliferation or cellular proliferation occurs such as cancer, e.g. 

CC prostate cancer. This sequence represents the androgen receptor 

CC transactivation signalling pathway modulator E00973 described in 

CC the method of the invention. 

XX 

SQ Sequence 2527 BP; 472 A; 888 C; 735 G; 432 T; 0 other; 

Query Match 73.5%; Score 991.6; DB 25; Length 2527; 

Best Local Similarity 85.7%; Pred . No. 3.1e-193; 

Matches 1142; Conservative 0; Mismatches 149; Indels 41; Gaps 2; 

C CGAG ATGG CG C CTT CGGGG CTG CGG CTCTTG C CG CTG CTG CTG CCG CTG CTG TGG CTG C 6 5 

ii iii 1 1 1 1 ii Minimi .minimum i !; 1 1 ii 1 1 1 1 1 1 i 



I 1 1 1 1 1 1 1 1 1 1 1 ! i ; MINIMI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 



AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mm 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill II II I 



CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MINIM MMIMNINN llllllll MM 1 1 II 1 1 1 1 1 1 II 1 1 1 1 

CCCTCTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 
(^GAGGCGGACTACTACGC(^GGAGGT(^CCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I II I II I I I " I I M I Nil 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCAC^GCT^ 4 2 5 

INN II MINI I I MINI llllllll I MINIM II I 

AAATCTATG A CAAGTT CAAG CAG AGTA CA CA CAG CATATATATGTT CTT CAA CA CAT CAG 1256 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

NIMH llllllll N INN II 1 1 1 1 f 1 1 1 MMIMNINN MINI! 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTC^GTTAAAAGTGGAGC^GCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

M I II MM 1 1 1 1 1 M 1 1 1 II I II I M 1 1 1 M 1 1 I M II MM I M M 1 1 1 MINI 

GGCTCAAGTTAAAAGTGGAGC^GC^CGTGGAGCTGTACC^GAAATACAGCAACAATTCCT 13 76 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

INI 1 1 1 1 II 1 1 Ml 1 1 1 M I II I II I INN Mill M WWW I N MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 14 3 6 



Qy 


6 


Db 


837 


Qy 


66 


Db 


897 


Qy 


126 


Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 



Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 


Qy 


906 


Db 


1698 


Qy 


966 


Db 


1758 


Qy 


1026 


Db 


1818 


Qy 


1086 


Db 


1878 


Qy 


1146 


Db 


1938 


Qy 


1206 


uu 


1 QQQ 

l j y o 


Qy 


1266 


Db 


2056 


Qy 


1326 


Db 


2116 



Mllllllllllllllllllllllll Ml III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 
TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

I II lllllll MM MMMM lllllllllll II Mill II lllllll 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

III I I Ml II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I ! 1 1 lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

I I ; M 1 1 1 ! 1 1 1 M M 1 1 i 1 1 1 i 1 1 Mill. Mill 1 1 II II II 1 1 1 1 1 1 1 1 1 I 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

MMMM MMMM Mil 



III ! 1 1 1 1 1 1 I I 1 1 1 1 ! I 1 1 1 1 1 : I 1 : 1 1 i 1 1 M I I MMMM lllllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

I MMMM II II 1 1 II 1 1 1 1 1 1 1 1 1 : 1 1 1 1 MMMM lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 
GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

I llilllll llilllll MM IMMMIMMMIMM I Ml M l 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 
ACAA CCCGGG CG CGT CGG CGG CG CCGTG CTG CGTGC CG CAGG CG CTGG AG CCACTG C C CA 

I lllllllllll 1 1 Ml 1 1 M M 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 MM M 1 1 lllllll 

ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 
TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 

i 1 1 1 1 1 ! 1 1 M . I M I 1 1 I ! 1 1 1 1 1 . 1 1 1 1 I , . 1 1 1 1 1 1 1 1 1 1 1 , 1 1 1 ' I 

TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCT 
CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 

I I I 1 1 1 II I 1 1 I I M 1 1 M MMMM I lllllllll II II I 

CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC - - CCGCCCCGCCCGGCCCCACCCCGCCCC 
CCCACCCCCGCCCGCCTCACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTG 

II II I Ml I II 1 1 M 1 1 1 1 M 1 1 1 1 1 llllllllllllllll II 

GCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAGCCCACCTG 

132 6 GGATCGATTAAA 13 3 7 

II I III 
2116 GGGCCCCATTAA 2127 



RESULT 3 
AAN60972 

ID AAN60972 standard; cDNA; 2537 BP. 
XX 



AC 


AAN60972; 




v V 
AA 






DT 


31-OCT-2002 


(updated) 


U 1 


28-OCT-1991 


(first entry) 


XX 






HP 

Ucj 


Sequence encoding preTGF-beta. 


V V 

AA 








Transforming growth factor beta; cancer; wound healing. 


Y Y 
AA 






Ub 


Unidentified. 


Y Y 
AA 






FH 


Key 


Location/Qual if iers 


FT 


misc structure 37.. 113 


FT 




/*tag= a 


FT 




/note= "Sequence can form stable hairpin loops" 


FT 


CDS 


842 . .2014 


FT 




/*tag= b 


r I 


mat peptide 


1676 . .2011 


prp 




/*tag= c 


XX 






FN 


EP200341-A. 




Y Y 

AA 






nn 
FU 


10-DEC-1986 . 




Y Y 
AA 






Fr 


21-MAR-1986; 


86EP-0302112 . 


V V 

AA 






PR 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


V V 

AA 






PA 


(GETH ) GENENTECH INC. 


XX 






D T 
Fl 


Derynck RMA; 




V V 

AA 






UK 


WPI ; 1986-326875/50 . 


UK 


P-PSDB; AAP61468. 


Y Y 
AA 






F 1 


TGF-beta prodn. from transformed hosts - useful esp. for treating 


PT 


wounds (J6 2/9/86) . 


AA 






PS 


Disclosure; 


Fig lb; 26pp; English. 


Y Y 
AA 






cc 


The gene product is known to stimulate cell proliferation and 


cc 


inhibit anchorage-dependent growth of a variety of human cancer eel 


cc 


lines, it is 


esp. useful in treatment of burns and the promotion of 


cc 


surface and 


internal wound healing. TGF-beta may be expressed from 


cc 


transformed 


CHO cell line. 


cc 


(Updated on 


31-OCT-2002 to add missing OS field.) 


XX 






SQ 


Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 73.4%; Score 9 9 0.8; DB 7; Length 2537; 

Best Local Similarity 85.5%; Pred . No. 4.5e-193; 

Matches 1146; Conservative 0; Mismatches 147; Indels 47; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III I I 1 1 II IMIMIIMI "1 1 1 II 1 1 1 1 II; 1 1 1 1 1 1 1 1 1 1 1 1 M I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 



Qy 


66 


Db 


897 


Qy 


126 


Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I llllllllllllll MINIM Mill llllllllllllll MINIUM 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

MIIIIIIIIMM MIIMIIMM Ml MMIIIMM IMIMIMM MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I M Ml MM M N M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M II II 1 1 1 1 1 1 Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll IMMMMMMI MINIM MM 1 1 1 1 1 1 N 1 1 1 II II I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill II Mill M I III .Ml MM M I III I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

-lllllll I II Mill I I Mill llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGC^TATATATGTTCTTCAAC^(^TCAG 12 56 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll llllllll II INN II llllllll Mill M III lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

IIIIIIMIIMIIIIIMI III IIIIIMII IMIMMMMIMM MINI 

GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATA(^G(^CAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

INI IIIIIIIIIIIIIIIIIIIIIII INN INN II II II I II INI 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 14 36 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

II I II 1 1 III 1 1 1 1 II I M I II I M; III III III II II INN lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 



I lllllll INI IIIIIIM lllllllllll II INN II lllllll 



TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I 1 1 1 1 1 1 1 . MIMM IMMMIMM MMMMMM lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 
TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I 1 1 II I MM M I M I M I! II I M I M I II 1 1 INN II 1 1 II I II II 1 1 1 II 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 9 05 

llllllll llllllll INI 

GCCCTGGACACCAACTATTGCT 1697 



Qy 9 06 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

! M 1 1 1 1 1 1 1 [ I 1 1 ! 1 1 1 1 1 MM! 1 1 1 1 1 1 M I IM Mill llllll 

Db 16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1757 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

I MM! 1 1 1 1 1 1 Mill MM MIMM lllllll 

Db 175 8 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCT 1817 

Qy 1026 GTCCCTAC^TCTGGAGCCTAGAC^CTCAGTACAGC^AGGTCCTGGCTCTGTACAACCAGC 108 5 

I MIMM MINIM INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM MM 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I MMM III II I M II II 1 1 II I MM 1 1 II M 1 1 1 1 II M 1 1 1 1 lllllll 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 193 7 

Qy 114 6 T CGTGTA CTA CGTGGG CCG CAAG C C CAAGGTGG AG CAG CTGTC CAA CATGAT CGTG CGTT 12 05 

! ! I ! M I ! ! I ! I ! 1 1 ! I ! ! | ! ! ! ! ! ! ! ! ! M 1 1 II ! | ! 1 1 1 ! ! ! I ! ! II ! 1 1 ! I ! I M I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

lllllllllllllllllll I 1 1 1 1 1 M 1 1 1 II I II I II INN I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

III INI III II II I III II MIMM III III MIMM II I 

Db 2 058 GCGGCCCCGCCCGCGGTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 1337 

I II I I Ml I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 4 
AAQ03301 

ID AAQ03301 standard; DNA; 2537 BP. 
XX 

AC AAQ033 01; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 

XX 

DE cDNA encoding human pre-transf orming growth factor-beta- 1 (pre-TGF-beta- 

DE 1) . 

XX 

KW Transforming growth factor-beta - 1 (TGF-beta-1) ; 

KW neoplastic cell line inhibition; 

KW EGF-potentiated anchorage- independent growth; 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT misc difference 37.. 113 



FT 
FT 
FT 
FT 
FT 
FT 



/*tag= c 

/note="stable hairpin loops" 



misc feature 



2015. .2100 
/*tag= d 



/note="G-C rich sequence 
and a downstream TATA- 1 ike sequence" 



XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 87US-0025423 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR P-PSDB; AAR05258. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC It was obtained by an analysis of several overlapping cDNAs and gene 

CC fragments, leading to the detn. of a continuous sequence corresp. to the 

CC TGF-beta-1 precursor mRNA . It is useful in constructing vectors that 

CC encode biologically active transforming growth factor (TGF-beta) , 

CC operably linked to DNA that encodes a secretory leader (SL) . It, or a 

CC nucleic acid capable of hybridising with it, can also be labelled and 

CC used in diagnostic assays for DNA or mRNA encoding TGF-beta or related 

CC proteins. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 73.4%; Score 9 9 0.8; DB 11; Length 2537; 
Best Local Similarity 85.5%; Pred . No. 4.5e-193; 

Matches 1146; Conservative 0; Mismatches 147; Indels 47; Gaps 2; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 



Db 




Qy 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 





Db 



8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 9 



Qy 



126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 



Db 




Qy 



186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 


Qy 


906 


Db 


1698 


Qy 


966 


Db 


1758 


Qy 


1026 



I lllllllllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 305 

I II llllllll llllllllllllll IIMIIM I II I II 1 1 1 1 1 1 : 1 1 1 1 II 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I IIM'II 1 l : M II II llllll. llllll I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

Illlllllll II llllll I I MINI llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 12 56 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

Illllll IIMIIM II Mill II llllllll llllllllllllll Illllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GG CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCC^ 54 5 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M I II M II 1 1 1 II 1 1 1 II 1 1 1 II llllll 

GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM lllllllllllllllllllllll Mill Mill II llllll I II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 
ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 . 1 1 1 1 1 1 :[ 1 1 1 1 1 1 ! 1 1 1 1 : Ml 1 1 I III II II Mill Illllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 14 96 



I II Illllll MM llllllll IIIIIIMIII II Mill II Illllll 



TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I llllllll I I II I I I I I I I I M I : I I II llllllllllllll Illllll 
TCACTACCGGCCGCCGAGGTGACCTGGGCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I IIMIII'IIMIMIIIII II llllll Mill III II Mirillll 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA - 1675 
CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

llllllll llllllll MM 

GCCCTGGACACCAACTATTGCT 1697 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

' I II I M 1 1 1 1 1 1 M II I M 1 1 h 1 1 1 1 1 1 1 1 1 1; 1 1 1 1 1 1 1 1 1 1 1 1 1 Illllll 

TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 17 57 
TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

I I f 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 III I IIMIIM Illllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 
GTCCCTA(^TCTGGAGCCTAGACACTC^GTAC^G(^GGTCCTGGCTCTGTACAACCAGC 1085 

I MINI 1 1 1 1 M 1 1 Mill IIIMIIIIIIIIIIIIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



1818 GCCCCTA(^TTTGGAGCCTGGA(^CGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 18 77 



Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I Illllllllll IIIIIIIMIIIIIIIII lllllll IMMIIIII lllllll 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 II 1 1 1 1 II ; 1 1 1 1 M I M 1 1 1 1 M I M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II I > 1 1 1 1 1 i I I 

Db 193 8 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 65 

IIIIMIIIIIIIIIIIII MMIMIMM II I II I I I Mill I 

Db 19 98 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGGCCCGCCCCGCCCCGGCAGGCCGGGCCCCAC 2 0 57 

Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

Ml MM III M II IIIIIIIMIIIIIIIII IN. Ml 

Db 2 058 GCCGGGCCGCCCGCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCGCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 1337 

Mill MM I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 5 
AAQ02814 
ID 
XX 
AC 
XX 



DT 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



AAQ02814 standard; cDNA; 2537 BP. 
AAQ02814; 



25-MAR-2003 
31-OCT-2002 
31-MAY-1989 



(updated) 
(updated) 
(first entry) 



Sequence of pre-TGF-betal cDNA . 

Transforming growth factor beta -3 
inhibition . 



(TGF beta 3); tumour cells; growth 



Homo sapiens. 

Key 
CDS 



CDS 



GC_signal 



misc feature 



stem_loop 



misc feature 



Locat i on/Qua 1 if iers 
842. .2011 
/*tag= a 

/label=pre-TGF beta 1 
1677 . .2011 
/*tag= b 

/label=mature TGF-beta 1 
2015 . .2092 
/*tag= c 
2093 . .2099 
/*tag= d 

/label=TATA-like sequence 
37. .113 
/*tag= e 
863 . . 911 
/*tag= f 

/ label ^hydrophobic domain 



XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 8 8WO-US0 1 94 5 . 
XX 

PR 08-JUN-1988; 88WO-US01945 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 

DR P-PSDB; AAR04034. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta -3 -used as a 

PT probe, or to produce TGF beta 3, for qrowth inhibition of certain normal 

PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. lb; 61pp; English. 
XX 

CC Sequence encodes the 390 amino acid (AA) precursor transforming growth 

CC factor-beta 1 (pre-TGF-beta 1) polypeptide. The 5' untranslated region of 

CC the TGF-beta 1 mRNA is 841 bases long, is purine rich and has a region of 

CC potential secondary structure. The TATA -1 ike sequence in the 3' untrans- 

CC lated region of the gene is presumably a polyadenylat ion signal. Mature 

CC TGF-beta 1 comprises the C-terminal 112 AA ' s of pre-TGF-beta 1 and is 

CC cleaved at the Arg-Arg dipeptide preceding its NH2 terminus. The nucleic 

CC acid encoding the second subtype of TGF-beta (TGF-beta 3) is useful as a 

CC probe or to produce TGF-beta 3 for inhibition of growth of normal and 

CC neoplastic cells. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 

Query Match 73.4%; Score 990.8; DB 11; Length 2537; 

Best Local Similarity 85.5%; Pred . No. 4.5e-193; 

Matches 1146; Conservative 0; Mismatches 147; Indels 47; Gaps 2; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

M III I I I I II lllllllllll I II I II I II Ml II I I I I I I I I I I I I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTG CTGA CG C CTGG C CGG CCGG CCG CCGGACTGT CCA C CTG CAAGACCAT CGA CATGG 125 

I I M 1 1 1 1 1 1 1 1 1 1 II III Mill llllllllllllll llllllllll 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGA CATGG 956 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

ll ; I 'I lllllllllll lllllllllll I I I I 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I IIIIIIIIIIIIIIIIIIIIIII IMIIIIIIIIMIIIMM Mill II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 



Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II IIIIIIII IIIIIIIIIIIIII IIIIIIII MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


1077 


1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

MMMMII II IIIIII 1 1 IIIIII IIIIIIII 1 IIIIIIII II 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


425 


Db 


1197 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

IIIIIII MMM II Mill II MINIM IIMIIMIIIIII IIMIII 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


1257 


1316 


Qy 
Db 


486 
1317 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 

IIIIIIIIIII 1 1 M 1 1 1 1 1 MMM III III 1 IM MMM 

GGCTC^AGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 
1376 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM MMMMMMMMMMMI Mill Mill 1 1 MMM 1 II MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1377 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

II 1 M 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 III III III II II Mill IIIIIII 
ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 


725 


Db 


1497 


1 II IIIIIII MM IIIIIIII IIIIIIIIIII II Mill II IIIIIII 

TTAGCGCCGACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


1556 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml 1 1 IIIIIIII IIIIIMIIIIIMIIIMI IIIIIIIIIIIIII IIIIIII 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

i 1 1 m ii i m i m ii i ii i m ii 1 1 1 ii 1 1 m Mm 1 1 1 r i r 1 1 1 1 1 1 1 1 1 r 1 1 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1617 


1675 


Qy 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

IIIIIIII IIIIIIII MM 
GCCCTGGACACCAACTATTGCT 


905 


Db 


1676 


1697 


Qy 

Db 


906 
1698 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIIIIIIIII IIIIIII 
1 II 1 I II 1 1 1 1 1 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I I I ; IIIIII 

TCAG CTC CA CGGAGAAGAACTG CTG CGTG CGG CAG CTG TA CATTGACTT C CG CAAGGA CC 


965 
1757 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 M 1 1 1 1 1 1 1 II II 1 II II II 1 II MIM IM IIIIII 1 IIIIIIII IIIIIII 

TCGG CTGGAAGTGGAT C CA CGAG C C CAAGGG CTACCATG C CAA CTT CTG C CT CGGGCC CT 


1025 


Db 


1758 


1817 


Qy 
Db 


1026 
1818 


GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 

1 IIIIIIII MINIM Mill II Mill IMIIMM IM 1 1 1 1 II M 1 1 II M 

GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 


1085 
1877 



Qy 108 6 ACAAC C CGGG CG CGT CGG CGG CG CCGTG CTG CGTG CCG CAGG CG CTGGAG C CACTG CC CA 114 5 

I 11:11 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II ! M 1 : 1 1 II I II 1 1 1 1 1 li MINI 

Db 18 78 ATAA C CCGGG CG CCT CGG CGG CG CCGTG CTG CGTG CCG CAGG CG CTGGAG C CG CTG CC CA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGTT 12 05 

IIIIMIMI IIIIIIIIIMIIIIIIIIIIIIIIMII II MINI lllllil I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCC^GGTGGAGC^GCTGTCC^ACATGATCGTGCGCT 1997 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMIMMI MINIM MMI I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 

Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

Ml MM Ml II II lllllllllll II III lllllllllll 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 C C CACTTGGG AT CG ATTAAA 1337 



Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 6 
AAT15720 

ID AAT15720 standard; cDNA; 2537 BP. 
XX 

AC AAT15720; 
XX 

DT 25-MAR-2003 (updated) 

DT 24-JUL-1997 (revised) 

DT 25-JAN-1980 (first entry) 
XX 

DE Pre-transforming growth factor beta 1 cDNA . 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production; ss. 



XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT 5 'UTR 1 . .841 

FT /*tag= a 

FT misc_feature 37.. 113 
FT /*tag= b 

FT /note= "GC-rich region forms stable hairpin loops; 

FT similar to structural organisation of c-myc RNA, 

FT could play role in mRNA stability or in 

FT regulation of transcription" 

FT CDS 842 . .2014 

FT /*tag= c 

FT /product= pre-TGF_beta_l 

FT mat_peptide 1676.. 2011 

FT /*tag= d 

FT /product = mature_TGF_beta_l 

FT repeat_region 2015.. 2100 
FT /*tag= e 

FT /note= "GC-rich region; possibly responsible for the 

FT fact 3'UTR of mRNA could not be cloned as cDNA; 



FT may be important for transcription efficiency" 

FT repeat_unit 2019.. 2023 

FT /*tag= f 

FT TATA_signal 2094.. 2100 

FT /*tag= g 

FT /note= "TATA- like sequence; no evidence that this 

FT functions a promoter" 

FT polyA_signal 2514 . .2520 

FT /*tag= h 

FT misc_signal 2529.. 2536 

FT /*tag= i 

FT /note= "consensus sequence immediately precedes 

FT polyA-tail (Benoist et al)" 

XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US- 0147364 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 85US-0715142 . 

PR 04-AUG-1989; 8 9US- 0389929 . 

PR 04-MAR-1992; 92US- 084 58 93 . 

PR 05-NOV-1993; 93US - 0 14 7364 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 

DR P-PSDB; AAR90827. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 

PT using Chinese hamster ovary cells, for use in diagnostic applications 

PT or in therapy 
XX 

PS Example 3; Fig 1; 26pp; English. 
XX 

CC The cDNA encodes the pre- transforming growth factor (TGF) beta 1 protein. 

CC The nucleotide sequence was obtd. by an analysis of several overlapping 

CC cDNAs and gene fragments. The DNA is useful for the recombinant 

CC production of TGF beta 1, which can be used in, e.g. wound healing. 

CC (Revised entry submitted to correct sequence analysis breakdown.) 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other,- 

Query Match 73.4%; Score 990.8; DB 17; Length 2537; 

Best Local Similarity 85.5%; Pred. No. 4.5e-193; 

Matches 1146; Conservative 0; Mismatches 147; Indels 47; Gaps 2; 

Qy 6 C CGAGATGG CG C CTT CGGGG CTG CGG CTCTTG C CG CTG CTG CTG C CG CTG CTGTGG CTG C 6 5 

II III I I I I II lllllllllll lllllllllllll llilllllllllll I 
Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 



Db 


897 


Qy 


126 


Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 


Db 


1497 


Qy 


726 


Db 


1557 


Qy 


786 


Db 


1617 


Qy 


846 


Db 


1676 


Qy 


906 



I llllllllllllllll lllllllll Mill llllllllllllll llllllllll 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

1 1 II 1 1 . 1 1 1 1 1 M 1 1 1 1 1'l I !! I II 1 1 ! lllllllllll II I M II II 1 1 1 1 1 1 

AG CTGGTGAAG CGGAAG CG CATCGAGG C CATCCG CGG C CAGAT CCTGT CCAAG CTG CGG C 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I lllllllllllllllllllllll llllllllllllllllllll Mill 1 1 II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGGCCGAGC 3 05 

I II llllllll llllllllllllll MINIM MM II I ' M : 1 1 1 1 1 1 1 1 

CCCTGTA CAA CAG CA CC CG CGAC CGGGTGG C CGGGGAGAGTG CAGAAC CGG AG CC CG AG C 1136 
CAGAGGCGGACTACTACGCGAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I I I M I I I I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTC^GGGCACCCCC(^(^GCTTATATATGCTGTTC^ACACGTCGG 4 2 5 

! 1 1 1 1 1 1 1 1 1 II MUM I I MINI llllllll I llllllll II I 

AAATCTATGACAAGTTCAAGC^GAGTAC^CA 12 56 

AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

lllllll llllllll II Mill II MINIM I r 1 J 1 1 1 1 1 1 1 1 [ I lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTC^GTTAAAAGTGGAGCAGC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

M IIIIIIML! Irll Mill II M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II llllll 

GGCTCAAGTTAAAAGTGGAGC^GC^CGTGGAGCTGTACC^GAAATAC^GC^CAATTCCT 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

MM llllllllll IIIMIMM Mill Mill II llllll I M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1436 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml III III II II Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 
TCAGTGCCCACTCTTCCTCTGAC^GawVGATAACAC^CTCC^CGTGGAAATTAACGGGT 

I M lllllll MM llllllll IMIMIMM M Mill M lllllll 

TTAGCGCCC^CTGCTCCTGTGAC^GC^GGGATAACAC^CTGC^GTGGAC^TCAACGGGT 
TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

in i i Eiiiiiii r 1 1 f 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 'i 1 1 1 mi 1 1 1 1 iiiiiii 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 
TCCTC^TGGCC^CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

I M II I II I M II 1 1 II I II 1 1 II II M II 1 1 I Mill MMMMMIIMIMI 



llllllll IMMIII MM 

- GCCCTGGACACCAACTATTGCT 16 97 



IMMIIMMMMIMIMMMIMIMMIMI llllllllllllll Mill 



Db 



16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 17 57 



Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

I MIIIM III II II II 1 1 II 1 1 1 ! 1 1 1 1 M Ml 1 1 I i 1 1 1 1 1 ; lllllll 

Db 175 8 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

I llllllll III! Mill Ml lllllll ilh III 
Db 1818 GCCCCTACA.TTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I Illllllllll IIMMMIMM MMIIMMMIIMI II III lllllll 
Db 1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1: lllllll 1 1 1 ; II il I II I ii mm ii mi Mini mmi mi ill ,11 il I i 

Db 193 8 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

lllllllllllllllllll M I MM llllllll Mill I 

Db 19 98 CCTGCAAGTGCAGCTGAGGTCCCGCGCCGGCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

III MM III II II lllllllllllllllllll Illllllllll 

Db 2 058 CCCGCCCCGCCCGCGCTGCGTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 13 37 

Mill MM I I II 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 7 
AAQ56923 

ID AAQ56923 standard; cDNA; 2537 BP. 
XX 

AC AAQ56923; 
XX 

DT 25-MAR-2003 (updated) 

DT 09- JUL- 1994 (first entry) 

XX 

DE Human pre-TGF-beta - 1 . 
XX 

KW TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 

KW transforming growth factor beta-3; recombinant; wound healing; 

KW vulnerary; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT misc_structure 47.. 113 
FT /*tag= a 

FT /note= "possible hairpin loop region" 

FT CDS 842.. 2014 

FT /*tag= b 

FT mat_peptide 1676.. 2011 

FT /*tag= c 

FT polyA_signal 2515.. 2521 



FT /*tag= d 

XX 

PN US5284763-A. 
XX 

PD 08-FEB-1994. 
XX 

PF 04-MAR-1992; 92US - 084 58 93 . 
XX 

PR 22-MAR-1985; 85US - 0715142 . 

PR 13-MAR-1987; 87US - 0025423 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US - 084 58 93 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR P-PSDB; AAR46227. 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta-1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227 -29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR4 623 0) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 890 C; 742 G; 432 T; 0 other; 



Query Match 73.3%; Score 989.2; DB 15; Length 2537; 

Best Local Similarity 85.4%; Pred. No. 9.6e-193; 

Matches 114 5; Conservative 0; Mismatches 148; Indels 47; Gaps 2; 



Qy 


6 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II IN Mil II lllllllllll lllllilllllll Mllllllllllll 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


65 


Db 


837 


896 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 llllllllllllllll IMIIIIII Mill llllilllllllll Mllllllll 

TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


125 


Db 


897 


956 


Qy 


126 


AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 

IMIIMIIIIIIIIIIIIMMMIIIMI lllllllllll lllllllllll MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


185 


Db 


957 


1016 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 II 1 1 1 1 M 1 1 II M 1 1 1 M 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 . 1 1 1 1 1 1 1 1 Mill II II 1 

TCG CCAG C C C C C CG AG C CAGGGGG AGGTG CCG C C CGG C C CG CTG C C CG AGG C CGTG CT CG 


245 


Db 


1017 


1076 



Qy 
Db 


246 
1077 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll llllllllllllll llllllll MM MM 1 1 1 1 1 1 1 1 1 1 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGC 


305 
1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAAGG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 


425 


Db 


1197 


MMMMM M MMM 1 1 MMM MMIMI 1 MMMM M 1 

AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGGCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll llllllll II Mill II llllllll MMM, MM h MIMM 

AGCTCCGAGAAGGGGTACCTGAAGCCGTGTTGCTGTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


1257 


1316 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMM 
GGCTCAAGTTAAAAGTGGAGCAGCAGGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


1317 


1376 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


1377 


MM IMMMMMMMMMMM Mill IMM II MMM 1 M MM 

GGGGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 1 1 [ 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Ml III II M Mill lllllll 

ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II MIMM MM MMMM M MM, II Mill M MIMM 

TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 


725 


Db 


1497 


1556 


Qy 


726 


TCAATTCTGGGGGCCGGGGTGACCTGGGCACCATTCACGGCATGAACCGGGCCTTCCTGC 

III 1 1 MMMM ' MMM MMMMM i Ml IMIMMMMM lllllll 

TCACTACCGGCCGGCGAGGTGACCTGGGCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGGCCAGCACCTGCACAGGTCCCGGCACCGCCGAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 
TTGTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACGGCCGA- 


845 


Db 


1617 


1675 


Qy 


846 


CCGTGGATAGCAACGACTACAAGGATGAGGACGACAAGGCCGTGGATACCAACTACTGCT 


905 


Db 


1676 


MMMM MMMM MM 

GCCCTGGACACCAACTATTGCT 


1697 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 
M 1 1 1 II 1 1 M 1 1 1 II 1 II 1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 1 1 1 j 1 1 1 1 1 1 1 ! 1 1 lllllll 

1 \J\\j\^ 1 LCJALLjtjAbAAbAAL 1 bL 1 I bLbOUALjL T OTACAT 1 CjALTTCCGLAACjCjALL 


965 


UD 


lb y o 


1757 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 llllllllllllll II M I 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 llllllll lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1025 


Db 


1758 


1817 


Qy 


1026 


GTCCCTACATCTGGAGCCTAGACACTCAGT^ 

1 MMMM llllllll Mill 1 r 1 1 1 1 1 f 1 1 F 1 1 1 1 1 1 1 1 1 M MMMMM 

GCCCCTACATTTGGAGCCTGGAC^CGCAGTAC^GC^AGGTCCTGGCCCTGTACAACCAGC 


1085 


Db 


1818 


1877 



Qy 



1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 



I MINIMI I I I M 1 1 1 1 MINI Ml Ill I III 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 M 1 1 1 1 Ml III 1 1 1 1 II II 1 1 M 1 1 II 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 MM 1 1 1 1 I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 



Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

IIIIIMIIIIIIIIIIII I II I II I I I Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 0 57 



Qy 1266 CCCACGCCCGCCCGCCT CA C CGGGG CTGT ATTT AAGG A CAT CGTG C C CCAAG 1317 

III 1 1 1 1 III 1 1 II MINI II MUM 

Db 2 05 8 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 1337 

INN I I I I I I I I 
Db 2118 CCCACCTGGGGCCCCATTAA 2137 



RESULT 8 
AAV52933 

ID AAV52933 Standard; cDNA; 2537 BP. 
XX 

AC AAV52 933; 
XX 

DT 25-MAR-2003 (updated) 

DT 21-DEC-1998 (first entry) 

XX 



DE Human pre- transforming growth factor-beta 1 cDNA. 
XX 

KW Transforming growth factor-beta 1; TGF-beta 1; human; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842.. 2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT stem_loop 37.. 113 

FT /*tag= b 

FT /note= "putative stable hairpin loop" 

FT misc__f eature 2015.. 2100 
FT /*tag= c 

FT /note= "GC-rich sequence" 

FT polyA_signal 2514.. 2520 
FT /*tag= d 

XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US - 04544 68 . 
XX 

PR 13-MAR-1987; 87US - 0025423 . 
PR 22-MAR-1985 ; 8 5US - 07 15 142 . 



PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US - 084 58 93 . 

PR 05-NOV-1993; 93US - 014 73 64 . 

PR 30-MAY-1995; 95US-0454468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR P-PSDB; AAW78785 . 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This nucleotide sequence codes for the human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1, see AAW78785) . It is a 

CC composite of overlapping cDNA clones isolated from different cDNA 

CC libraries (placenta, A172 glioblastoma, HT1080 f ibroblastoma) using 

CC TGF-beta exon (see AAV52936) restriction fragments as probes. 

CC The 3' region of the sequence was determined using cloned genomic 

CC DNA . The invention relates to the recombinant production of 

CC TGF-beta. Biologically active TGF-beta is defined as being capable 

CC of inducing EGF-potentiated anchorage independent growth of target 

CC cell lines and/or growth inhibition of neoplastic cell lines. 

CC Nucleic acids encoding TGF-beta have been isolated and cloned into 

CC vectors which are replicated in bacteria and expressed in 

CC eukaryotic cells. TGF-beta recovered from transformed cells is 

CC used in known therapeutic applications. TGF-beta nucleic acids are 

CC also useful in diagnosis and identification of TGF-beta clones. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 475 A; 895 C; 736 G; 431 T; 0 other; 



Query Match 73.1%; Score 986; DB 19; Length 2537; 

Best Local Similarity 85.3%; Pred. No. 4.3e-192; 

Matches 1143; Conservative 0; Mismatches 150; Indels 47; Gaps 2; 



Qy 
Db 


6 

837 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II Ml MM M II Mill lllllll! 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


65 
896 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 III 1 1 1 1 1 i 1 1 1 1 1 MIMM 1 Mill IIIIIIIIIIIIII llllllllll 

TGGTGCTGACGCCTGGCCCGCCGGCCCCGGGACTATCCACCTGCAAGACTATCGACATGG 


125 


Db 


897 


956 


Qy 


126 


AG CTGG TG AAG CGGAAG CG CAT CGAGG C CATT CG CGG C CAGATT CTGT CCAAG CTTCGGC 

Ml 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 ! j IMIIIIIMI IIIIIIIIIII MM 

AGCAGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 


185 


Db 


957 


1016 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II II II IMIIIIIIIIIMIIIIII Mill II II 1 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


1017 


1076 



Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II llllllll llllllllllllll llllllll MM II 1 1 1 M 1 I 1 1 1 1 1 1 1 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


305 


Db 


1077 


1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 llllll . 1 1 1 1 1 M 1 1 1 1 1 1 1 1 MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

llllllllll II MUM 1 1 llllll llllllll 1 IMIIMI M 1 

AAATCTATGACAAGTTCAAGC^GAGTACACACAGCATATATATGTTCTTCAACACATCAG 


425 


Db 


1197 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll IMIIMI II Mill II llllllll llllllllllllll lllllll 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


1257 


1316 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 
1 1 1 1 II 1 II II 1 II 1 II 1 1 1 II 1 II II II M II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 llllll 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


545 


Db 


1317 


1376 


Qy 


546 


GGCGCTAGCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

MM MIIIIIIIIIIMIIIMIIII Mill Mill II llllll 1 M MM 

GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 


Db 


1377 


1436 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 

1 1 1 1 1 II II 1 1 II 1 1 1 1 II II II 1 II III III III 1 II Mill lllllll 
ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGAGAAATTGAGGGCTTTCGCC 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCCCACTGTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 

1 II lllllll MM llllllll lllllllllll II Mill II lllllll 

TTAG CG C CCA CTG CTC CTGTGACAG CAGGGATAACA CA CTG CAAGTGGA CATCAACGGGT 


725 


Db 


1497 


1556 


Qy 


726 


TO^TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Ml 1 1 llllllll lllllllllll lllllll llllllllllllll lllllll 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCT(^TGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 M 1 1 1 1 1 M 1 II M 1 1 II II 1 M 1 1 1 1 1 1 1 1 Mill IMIIM llllllllll 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 


845 


Db 


1617 


1675 


Qy 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 

llllllll llllllll MM 


905 


Db 


1676 


1697 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

lllllllllllllllllllllllllllllllllllll llllllllllllll lllllll 


965 


Db 


1698 


1 TCT 

l /b / 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

1 llllllllllllll II II 1 II L TIM 1: lllllll 1 llllllll lllllll 

TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 


1025 


Db 


1758 


1817 


Qy 


1026 


GTCCCTACATCTGGAGCCTAGACACTCAGT^ 

1 IMIIMI IMIIMI Mill MM II llllllllll 1 MIMMMMM 

GCCCCTA(^TTTGGAGCCTGGAC^CG(^GTAC^GCAAGGTCCTGGCCCTGTAa^CCAGC 


1085 


Db 


1818 


1877 



Qy 



108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 



i iiiiiiiiiii Miriiiii ii iiiiiiiiiMiiiiiii'ii i iiiiiii 

Db 1878 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCy\AGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

Ml I MINI Ml I l.llllll I II llllllll Ml I MM llllll IIIIMI III I I 

Db 193 8 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 19 97 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

M . Ml II Ml I llllllllllll II llllll I M II I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2057 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

Ml I II I Ml II I I I ' 1 1 1 1 ! 1 1 1 1 1 1 i 1 1 1 1 lllllllllll 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCAAG 2117 

Qy 1318 CCCACTTGGGATCGATTAAA 1337 

lllll MM I I M 

Db 2118 CCCACCTGGGGCCCCATTAA 2137 

RESULT 9 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 

AC AAQ03268; 
XX 

DT 25-MAR-2003 (updated) 

DT 12-AUG-1990 (first entry) 

XX 

DE Simian transforming growth factor-beta cDNA. 
XX 

KW Transforming growth factor-beta; psoriasis; TGF-beta; ss . 
XX 

OS Monkey. 
XX 

FH Key Location/Qualifiers 
FT sig_peptide 283.. 324 

FT /*tag= a 

FT mat_peptide 1096.. 1431 

FT /*tag= b 

FT /product =human transforming growth factor-beta 

XX 

PN EP353772-A. 
XX 

PD 07-FEB-1990. 
XX 

PF 04-AUG-1989; 8 9EP- 01 144 58 . 
XX 

PR 05-AUG-1988; 88US- 022 9133 . 
XX 

PA (ON CO ) ONCOGEN LP. 
XX 

PI Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 
XX 

DR WPI; 1990-038499/06. 
DR P-PSDB; AAR03743. 
XX 



PT Inhibition of proliferation of epidermal cells - 

PT used to treat psoriasis by contacting cells with compositions 

PT containing transforming growth factor-beta. 

XX 

PS Disclosure; fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis-categorised skin diseases/ and 

CC in alleviating psoriatic symptoms associated with cytokme- 

CC induced phenomena. See also AAQ032 6 9 and AAR0375 0. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 other; 

Query Match 72.9%; Score 983; DB 11; Length 1561; 

Best Local Similarity 85.6%; Pred . No. 1.6e-191; 

Matches 1148; Conservative 0; Mismatches 145; Indels 48; Gaps 
!CGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

I III 1 1 1 1 II MINIMI 1 1 1 1 1 1 1 1 1 f I ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 



TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMI 

TGGTGCTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGG 376 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

M 1 II I II M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM I M I M M II I IIIIIIMMI MM 

AGCTGGTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 43 6 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I IIIIIIIIIIMIIIIIMIMI IMIIIIIIIIIIIIIIIII Mill M II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 4 96 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II Illlllll 1 1 1 1 1 1 1 1 1 E I r [ I Ml M MM M MIIMIMM I 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAAC 556 
C^GAGGCGGACTACTACGCC^GGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I MIM 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM 

CGG AGG C CG A CTA CTA CG C CAAGGAGGT CA C C CG CGTG CTAATGGTGG AAA CC CA CAA CG 616 



IMMMMI II MUM I I I llllll Illlllll I Illlllll II I 



AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

MIMM MIM II M 1 1 1 1 1 1 1 1 Illlllll MIIIIIIIMIII M M 

AGCTCCGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 736 

GGCTCAAGTTAAAAGTGGAGC^GC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

M M I II I M 1 1 II II M M I M M II II II M 1 1 1 M II II 1 1 1 1 1 1 1 1 MUM 

GGCTC^GTTAAAAGTGGAGC^GCATGTGGAGCTGTACCAGAAATAC^GCAAC^TTCCT 7 96 



MM I i 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 M 1 1 Mill MM ! 1 1 1 1 1 1 1 1 MM MM 



Qy 


6 


Db 


257 


Qy 


66 


Db 


317 


Qy 


126 


Db 


377 


Qy 


186 


Db 


437 


Qy 


246 


Db 


497 


Qy 


306 


Db 


557 


Qy 


366 


Db 


617 


Qy 


426 


Db 


677 


Qy 


486 


Db 


737 


Qy 


546 


Db 


797 



Qy 6 06 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

II II III I MINI I IMIMI II II II Ml II Ml II II Mill II II III 

Db 857 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCC 916 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 725 

I II IMIMI MM II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II Mill II IMIMI 

Db 917 TTAGCGCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGT 976 

Qy 7 26 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I 1 1 1 1 f 1 1 1 MMMIMIMM Mill I I ' I . 1 1 1 1 1 IMIMI 

Db 977 TCACTACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGC 1036 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I IMIMI Mill 1 1 1 1 , 1 1 1 1 ! 1 1 1 1 II Mill MMMMIMMIMM 

Db 1037 TTCTCATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA" 1095 

Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

IIIIIMI llllllllllll 
Db 10 96 GCCCTGGACACCAACTACTGCT 1117 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

M 1 1 II M 1 1 II M 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 II IIIIMIIIM IMIMI 

Db 1118 TCAGCTCCACGGAGAAGAACTGGTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACC 1177 

Qy 966 TGGG CTGGAAGTGGATT CATGAAC CCAAGGG CTAC CATG CCAATTT CTG C CTGGGG CC CT 1025 

I 'IMIMI M II MMIIIMIIMI MM IMIMI 

Db 1178 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCT 1237 

Qy 1026 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

1 1 1 1 1 1 1 1 1 1 IIIIIMI Mill MM IIIIIMI MUM 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1238 GTCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 12 97 

Qy 1086 ACAACCGGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I 1 1 1 i 1 1 1 1 [ 1 1 IIIIIMIIIIIIMIIMIIIIIIMIIIMIMIIIIIIMIII 

Db 12 98 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 13 57 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

MM III IMIII III I 1 1 1 MIMIIII II III Mill Mill MIMI MM I I 

Db 1358 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1417 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCGCGGC 1265 

IMIMI 1 1 1 1 1 1 II I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II I I I Mill II 

Db 1418 CCTGCAAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCGGGCCCCGC 1477 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 1316 

IIIIIMI III II I 1 . 1 1 1 1 ! 1 1 1 1 1 ' 1 1 1 1 llllllllll 

Db 14 78 CCCACCCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 1537 

Qy 1317 GCCCACTTGGGATCGATTAAA 1337 

MIMI MM I I II 

Db 153 8 GCCCACCTGGGGCCCCATTAA 1558 



RESULT 10 
AAI58342 

ID AAI58342 standard; cDNA; 2742 BP. 



XX 

AC AAI58342; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 545. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia; ss. 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2000WO-US34263 . 
XX 

PR 21-JAN-2000; 2000US- 048872 5 . 

PR 25-APR-2000; 2000US- 0552317 . 

PR 09-JUL-2000; 2000US-0598042 . 

PR 19-JUL-2000; 2000US-0620312 . 

PR 03-AUG-2000; 2000US- 06534 50 . 

PR 14-SEP-2000; 2000US-0662191 . 

PR 19-OCT-2000; 2000US-0693036. 

PR 29-NOV-2000; 2 000US-0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y , Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C, Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR P-PSDB; AAM39186. 
XX 

PT Novel nucleic acids and polypeptides; useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Claim 1; SEQ ID NO 545; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI 57798 -AAI61369) and 

CC the encoded polypeptides (AAM38642 -AAM42213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotact ic/chemokmet ic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 



CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 2742 BP; 526 A; 938 C; 799 G; 479 T; 0 other; 

Query Match 72.8%; Score 981.4; DB 22; Length 2742; 

Best Local Similarity 85.5%; Pred . No. 3.8e-191; 

Matches 1147; Conservative 0; Mismatches 146; Indels 48; Gaps 3; 

CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III 1 1 1 1 II III MM MMMMMMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 
TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 ' 1 1 1 1 1 M , 1 1 1 1 1 1 1 1 Mill llllllllllllll MINIUM 

TGGTG CTGA CG CCTGG CCGG C CGG CCG CGGGA CTAT C CACCTG CAAGA CTAT CGACATGG 9 56 
AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

Ml MM MIMM IMIMMI MMMM IMIMMIM IIMMIMM MM 

AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 
TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I IMIIMMIMMIMMMII MMMMMMI I II Mill II II I 

TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 
CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MMMM 1 1 1 1 1 1 1 ! 1 1 1 1 : 1 MMMM MM II 1 1 1 1 II 1 1 1 1 1 1 1 1 

CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 
CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill I M M M MM M M M M M I M MM II I M I Ml M M 1 1 I MM 

CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 
AAATCTATGATAAATTCAAGGGCACCCCCCAC^^ 425 

MMMM II MUM I I llllll MMMM I MMMM II I 

AAATCTATGACAAGTTCAAGCAGAGTAC^ 1256 
AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 85 

MIMM MMMM M Mill M MMMM llllllllllllll 1 1 1 1 1 1 1 

AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 
GGCTC^GTTAAAAGTGGAGC^GC^CGTGGAGCTATACC^GAAATACAGCAATGATTCCT 54 5 

III MMMIMI MMMM 1 1 MM 1 1 M M I II I II II M I II M II llllll 

GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTAC(^GAAATACA.GCAA(^TTCC^ 1376 
GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

MM 1 1 1 1 1 1 1 1 ' M 1 1 1 1 1 II II 1 1 1 Mill Mill || MUM I II MM 



Qy 


6 


Db 


837 


Qy 


66 


Db 


897 


Qy 


126 


Db 


957 


Qy 


186 


Db 


1017 


Qy 


246 


Db 


1077 


Qy 


306 


Db 


1137 


Qy 


366 


Db 


1197 


Qy 


426 


Db 


1257 


Qy 


486 


Db 


1317 


Qy 


546 


Db 


1377 


Qy 


606 


Db 


1437 


Qy 


666 



I II MIMM I II I MMMM MIMIIIIII II Mill II MIMM 



Db 



14 97 TTAGCGCCO^CTGCTCCTGTGACAGC^GGGATAACACACTGCAAGTGGACATC^CGGGT 



1556 



Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I llllllll illlllll! llllllllllllll lllllll 
Db 1557 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 1616 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I IN lllllll I Ml! MM I Mill I II IIIIH III I III II 

Db 1617 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 1675 

Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

llllllll llllllll I I I I 
Db 1676 GCCCTGGACACCAACTATTGCT 1697 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 1 1 1 1 II MM 1 1 1 1 1 MM 1 1 1 III 1 1 1 1 1 1 1 1 llllllllllllll lllllll 

Db 16 98 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 17 57 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I llllllllllllll II II Mill MIIMIMM llllllll lllllll 

Db 1758 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 1026 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 1085 

I llllllll llllllll Mill i i 1 : 1 1 1 1 1 1 1 llllllll IIIIIIMIIMI 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 1086 ACAA C CCGGG CG CGT CGG CGG CG C CGTG CTG CGTGC CG CAGG CG CTGGAG C CACTG C C CA 114 5 

I !M Ml MM III II I IMIMMMI IMIIMI III M lllllll 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

I MM MIMI I M Ml MM M Mill MMM III II I MUM I I Ml I M 1 1 I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1997 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 12 65 

II MMMMMIMM IIIIIIMIMI llllllll Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 2 057 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAA 1316 

Ml 1 1 1 1 III II II IIIIIIIIMIIMIIMI llllllll I 

Db 2 058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 2117 

Qy 1317 G CC CA CTTGGG AT CGATTAAA 1337 

MIMI 1 1 1 1 I I II 

Db 2118 GCCCACCTGGGGCCCCATTAA 2138 

RESULT 11 
AAQ2 02 8 9 

ID AAQ20289 standard; cDNA; 1559 BP. 
XX 

AC AAQ2 02 8 9; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1. 



XX 






KW 


nyptri LCHo -HJil 


UllfcrXcipy, IiypULcJlb J. Vc aycIlL , Jji.vJULl picbbUL C IUCJCJU.1 d LUI , 


KW 


Q Q 

iD O . 




A. A. 






OS 


l'ikji ifvc y . 










FH 


Key 


T .or* A Hnn/ Oi 1^1 "i f i pre; 

LjKJ \^ d L lull/ yUdl 1 1. 1CL o 


FT 
r i 


L-JJo 


-y (Z 9 n o 9 


r 1 




/ w uag= a 


r i 




9 a o 7 94 


PT 

F 1 




/ Lay- JJ 


PT 




o O cr i noo 
jZj . . lU^O 


PT 
r i 




/ ^ay— c 


PT 


Ilia L p ep LlUc 


lu^y . . i4Jb 


PT 
r i 




/ oay — LA 


YY 
AA 






PM 


W09119513-A. 




Y Y 
AA 






pn 


26-DEC-1991 . 




YY 

AA 






PF 


20-JUN-1991; 


91WO-US04449 . 


XX 






PR 


20-JUN-1990; 


90US-0541221 . 


XX 






PA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


XX 






PT 


Oleson FB, Comereski CR; 


XX 






LJi\ 


WPI; 1992-024199/03 . 


DR 


P-PSDB; AAR20124. 


XX 






PT 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists - 


for modulating blood pressure, for treating 


PT 


hypertension and hypotension 


XX 






PS 


Disclosure; Fig 1; 42pp; English. 


XY 

AA 






HP 


A new method 


for treating hypertension comprises administering a 


rc 


transforming growth factor (TGF) -beta to an individual at a dose 


cc 


effective for 


lowering blood pressure; the TGF-beta may be e.g. 


cc 


mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 


cc 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF-betal/TGF- 


cc 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


cc 


complex . 




cc 


(Updated on 25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 


BP; 300 A; 546 C; 446 G; 267 T; 0 Other; 



Query Match 72.7%; Score 980.6; DB 13; Length 1559; 

Best Local Similarity 85.7%; Pred . No. 5.1e-191; 

Matches 1144; Conservative 0; Mismatches 144; Indels 47; Gaps 3; 
Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 7 0 

III 1 1 1 1 II lllllllllll lllllllllllll IIIIIIIIIIMM II III 

Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 



Qy 



71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 



Db 


321 


Qy 


131 


Db 


381 


Qy 


191 


Db 


441 


Qy 


251 


Db 


501 


Qy 


311 


Db 


561 


Qy 


371 


Db 


621 


Qy 


431 


Db 


681 


Qy 


491 


Db 


741 


Qy 


551 


Db 


801 


Qy 


611 


Db 


861 


Qy 


671 


Db 


921 


Qy 


731 


Db 


981 


Qy 


791 


Db 


1041 


Qy 


851 


Db 


1095 


Qy 


911 



lllllllll 1 1 II 1 1 1 1 1 1 1 1 1 Mill MINIMUM 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 

CTGA CG C CTAG CCGG CCGG C CG CAGGA CT AT C CA C CTG CAAG A CTAT CG A CATGGAG CTG 38 0 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

1 1 1 1 . 1 1 1 1 1 1 1! 1 1 1 M 1 1 MM MINIMI! lllllllllll Mill III 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 250 

1 1 1 1 1 1 j i j 1 1 1 1 1 f 1 1 f 1 1 f 1 1 1 1 r i f 1 1 1 1 1 1 1 1 1 1 1 1 mil ii m ii ii 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

IMIIIII IIIIIMIIIIIII Mllllll MM II N INN M III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 56 0 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 37 0 

II I . I NIN I II N 1 1 1 N N I II I Nil 1 1 II I N I II II N I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 4 3 0 

Mill II IIIIM I I I llllll Mllllll I Mllllll M MUM 

TATGACAAGTTCAAGCAGAGCACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 68 0 
CGGGAAG CGGTG CCGGAAC CTGTATTG CTCTCT CGGG CAGAG CTG CGC CTG CTGAGGCT C 4 90 

II Mill II II Mllllll Mllllll M M 1 1 1 1 1 1 1 1 1 ! MIIIMIIMI 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAAAAGTGGAG(^G(^CGTGGAGCTATACC^GAAATACAGCAATGATTCCTGGCGC 55 0 

lllllllllll Illlllll Illlllll I IN 1 1 1 II I II I ! 1 1 1 llllilllll 

AAGTTAAAAGTCGAGCAG(^TGTGGAGCTGTACCAGAAATA(^Gay^CAATTCCTGGCGA 8 0 0 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MM 1 1 1 1 1 r r 1 1 MM lllllllll 

TACCT(^G(^CCGGCTGCTGGCGCCC^GCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 86 0 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

Illllllllllllllllllll III 1 1 1 1 III II II Mill IMIIIII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 



IMIMI MM Illllllllllllllllllll II Mill II llllilllll I 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 790 

I Illlllll MINIUM III! Mill llllll INI MM MINIM III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 



1 1 i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii mil i 1 1 1 : 1 1 1 1 1 1 1 , 1 1 1 1 1 



851 GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 910 

Illlllll Illllllllllllillll 
GCCCTGGACACCAACTACTGCTTCAGC 1121 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 970 

M 1 1 M II II II 1 1 II I II 1 1 II II II II 1 1 1 II lllllllllll Illlllll III 



Db 



1122 



TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 



1181 



Qy 971 TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 103 0 

M I II M I M I II 1 1 IIIMIIIMIMIIIMM MIIIIIIIIIMIMIIIII 

Db 1182 TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 1241 

Qy 1031 TACATCTGGAGCCTAGAC^CTC^GTACAGC^GGTCCTGGCTCTGTACAACCAGCACAAC 1090 

imii iiiiiiii mil 1 1 1 1 1 1 1 ! 1 1 1 : i ; i . 1 1 1 , iiiiiiiiiiiiii mi 

Db 1242 TACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAAC 13 01 

Qy 10 91 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1150 

mmm 1 1 1 1 1 1 r r 1 1 1 j r 1 1 1 f 1 1 1 r 1 1 1 1 1 f r 1 1 f i j 1 1 1 1 1 1 i i f 1 1 f 1 1 1 1 1 f 

Db 1302 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 13 61 

Qy 1151 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1210 

I II II 1 1 1 1 M 11 M M II 1 1 1 1 1 ! I M M M II 1 1 II 1 1 M 1 1 1 1 M 1 1 1 1 1 IMMI 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGC 1421 

Qy 1211 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 12 7 0 

II MMMMIMMMMMIMM II I II I I I I MM Ml I 

Db 14 22 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCGGCCCCGCCCCACC 14 81 
Qy 1271 CCCCGCCCGC CTCACCGGGGCTGTATTTAAGGACA-TCGTGCCCCAAGCCCAC 1322 

II I Mill I I Mlllllllllllllllll MIIMIIIIMIMI 

Db 14 82 CCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCAC 1541 

Qy 1323 TTGGGATCGATTAAA 1337 

MM I I II 
Db 1542 CTGGGGCCCCATTAA 15 56 



RESULT 12 
AAQ13392 

ID AAQ13392 standard; DNA; 1821 BP . 
XX 

AC AAQ13392; 
XX 

DT 20-NOV-1991 (first entry) 
XX 

DE Human pro -TGF -beta 1 gene. 
XX 

KW Osteogenetic; tumoricidal; ss. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 512 . . 1684 

FT /*tag= a 

FT sig_peptide 512.. 598 

FT /*tag= b 

FT misc_RNA 599.. 1684 

FT /*tag= c 

FT /note= "pro-TGF-beta 1" 

FT mat_peptide 1346.. 1684 

FT /*tag= e 

FT /note- "TGF-beta 1" 

XX 



PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PR 07-DEC-1989; 89JP-0318243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 

DR P-PSDB; AAR13813. 
XX 

PT Human pro-TGF-beta 1 prodn., for osteo-genetic activity - by 

PT preparing DNA chain contg . base sequence coding for human 

PT pre :pro-TGF-beta 1, forming expression vector etc. 
XX 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The DNA sequence encodes human prepro-TGF-beta 1 which can be 

CC produced by recombinant methods, it has osteogenetic and 

CC tumoricidal activity. 
XX 

SQ Sequence 1821 BP; 326 A; 679 C; 508 G; 308 T; 0 other; 

Query Match 72.7%; Score 980.2; DB 12; Length 1821; 

Best Local Similarity 85.6%; Pred. No. 6.3e-191; 

Matches 1133; Conservative 0; Mismatches 143; Indels 47; Gaps 2; 

Qy 6 C CGAGATGG CG CCTTCGGGG CTG CGG CTCTTG CCG CTG CTG CTG C CG CTG CTGTGG CTG C 6 5 

II I I I I I I I II lllllllllll lllllllllllll llllllllllllll I 
Db 507 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I I M I I I I I II I I I I II lllllllll Mill llllllllllllll llllllllll 
Db 567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 626 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

'1 1 1 1 Mi I '1 1 1 1 1 1 1, 1 1 1 II II 1 1 1 1 1 lllllllllll lllllllllll MM 

Db 627 AG CTGGTGAAG CGGAAG CG CAT CGAGG CCATCCG CGG CCAGATC CTGT C CAAG CTG CGG C 686 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I ! 1 1 1 1 1 1 1 1 ! 1 1 I i 1 1 ! 1 1 1 i 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 Mill M II I 

Db 687 TGG C CAG CCCCCCGAGC CAGGGGG AGGTG CCGCCCGGCCCGCTGCC CGAGG C CG TG CT CG 74 6 

Qy 246 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II Mil llllllllllllll 1 1 1 1 1 1 1 1 MM M 1 1 1 1 1 1 1 1 1 II M I 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 3 06 CAGAGGCGGACTACTA CGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I Mill 1 1 II 1 1 M M 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 II I M I II 1 1 1 1 M 1 1 I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 66 

Qy 3 66 AAATCTATGATAAATTC^GGGCACCCCCC^C^GCTTATATATGCTGTTC^C^CGTCGG 425 

llllllllll M MIMI I I llllll MIMIM I IMIIIII II I 

Db 867 AAATCTATGACAAGTTCAAGCAGAGTACACA.C^ 926 



Qy 42 6 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

Illllll IIIIIIM II Mill M IIIIIIM 1 1 1 1 1 1 1 1 1 1 1 1 1 Illllll 

Db 92 7 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 986 

Qy 4 8 6 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I II I I I I I I II II II I I I I I I 

Db 98 7 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 104 6 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 Mill Mill II llllll I II MM 

Db 104 7 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1106 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III III III II II Mill Illllll 

Db 1107 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1166 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTGCACGTGGAAATTAACGGGT 725 



Db 1167 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 122 6 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III I I I I i I I I I I I N I I I I ! I M I I I I I I I I I I I I I I I I I I I I I I I Illllll 

Db 122 7 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 128 6 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I Mill I I I I I i I I I I I I I I I I I I 

Db 12 87 TTGTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 134 5 

Qy 84 6 CGCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

MM 1 1 1 1 1 1 1 1 MM 

Db 134 6 GCCCTGGACACCAACTATTGCT 1367 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 Illllll 

Db 1368 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1427 

Qy 966 TGGGCTGGAAGTGGATTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I Mil II II II M M ! 1 1 1 1 1 1 1 1 1 : M 1 1 IIIIIIM Illllll 

Db 142 8 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 8 7 

Qy 102 6 GTCCCTAC^TCTGGAGCCTAGA(^CTC^GTACAG(^GGTCCTGGCTCTGTAC^CCAGC 1085 

I IIIIIIM IIIIIIM Mill I ; 1 1 1 1 1 1 MM I hill 1 1 1 1 1 1 1 1 1 1 1 1 

Db 14 8 8 GCCCCTA(^TTTGGAGCCTGGA(^CGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1547 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I 1 1 1 1 1 1 M M I M I M I M M I II M 1 1 M M MM M 1 1 1 M M M I Illllll 

Db 154 8 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M I II 1 1 1 1 1 1 1 1 1 M 1 1 1 I 

Db 16 08 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 

Qy 12 06 CCTGC^GTGC^GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

I I 1 1 1 M I II I II 1 1 1 1 II I MUM II I II I I I Mill I 

Db 166 8 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 

Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 



Ill MM Ml II M [ 1 1 1 1 I [ 1 1 1 E 1 1 1 1 1 1 1 F MM I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 1787 

Qy 1318 CCC 1320 

II 

Db 1788 GCC 1790 



RESULT 13 


AAN81084 


ID 


AAN81084 standard; CDNA; 1560 BP. 


XX 




AC 


AAN8 1084 ; 


XX 




DT 


25-MAR-2003 (undated) 


DT 


09-OCT-1990 (first entrv) 


XX 




DE 


rod i na ^Prtnpnrp nf ^imi^n fran^f nrmi na arowth f r3r , r*nr-bet~; :: i 1 

H X 1 ^ \-J U^llU W i- iJ X III 1 LI QUO i- v> X, 1 1 1 X 1 ^-j X \-/ W L- J 1 X Gi V L» wX X_> ^ — L» Cx X . 


XX 




KW 


Trsnc? f ormi na arnwrh fa^^nr-hp^a 1 • fiimmir f rpafmpnr • pDMA 

X X C* i i J i- v_v X lllXlivj ^ X Vw» WL11 LlAL L LJ X X_> I, (X X | U Ull UX L J. V^C* L.1IIU11L ( u U \— * ii. • 


XX 




OS 


rprrnnithecu'? aethions 


XX 




FH 


Kpv J ,dc^t inn/On^il i "Pi pre: 

i vd y uu La l i w 1 1 / ^ Lid x. x x xv^-x, o 


FT 


CDS 261 14 33 


FT 


/*tag= a 


FT 


^ia npntidp 982 323 


FT 


/*tag= b 


FT 


Ilia L pcptlUC x. \J Z/ ~J • . X *± «j O 


FT 


/ Lay — l_ 


XX 




PN 


EP293785-A. 


XX 




PD 


07 -npr. i qqq 

LV / LJ Ht\^ J. ^ 0 0 . 


XX 




PF 




XX 




PR 


2 9 -MAY -198 7; 8 7US- 0055662 . 


PR 


25 -JAN- 1 98 8 ; 88US -0147842. 


XX 




PA 


(ONCO ) ONCOGEN. 


PA 


(BRIM ) BRISTOL-MYERS CO. 


XX 




PI 


Purchio AG, Gentry L, Twardzik D; 


XX 




DR 


WPI; 1988-347488/49. 


DR 


P-PSDB; AAP80647. 


XX 




FT 


Prodn. of simian transforming growth factor beta-1 - by culturing 


PT 


transfected eucaryotic cells, and new precursor proteins, useful for 


PT 


treating tumours. 


XX 




PS 


Disclosure; Page ?; pp; English. 


XX 




CC 


The cDNA is prepd . from African green monkey cell line BSC- 4 0 and is 


cc 


expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 


CC 


between mature simian and human TGF-beta 1. The plasmid also contains 



CC the SV4 0 promoter and a selection marker, esp. DHFR. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 72.6%; Score 979.6; DB 9; Length 1560; 

Best Local Similarity 85.6%; Pred . No. 8.2e-191; 

Matches 1144; Conservative 0; Mismatches 144; Indels 48; Gaps 3; 

ATGG CG C CTTCGGGG CTG CGG CT CTTG C CG CTG CTG CTG C CG CTG CTGTGG CTG CT AGTG 7 0 

III 1 1 1 1 II IIIMIMIII 1 1 1 1 1 1 1 1 1 III I IIIIIIIIIIIIII II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

MINIMI IMMMMMM Mill II I M M M I II 1 1 MMMM llllll 

CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 3 8 0 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

I M 1 1 1 1 1 1 M I II 1 1 II 1 1 MM IIIMIMIII MIMMIMI Mill III 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 4 4 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 25 0 

MIIIIIIMIIIIMIMI llllllllllllllllllll Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

IIIIIIII MIMIMIIMM IIIIIIII MM II MIMMIMI II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 5 60 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 70 

II M I h 1 1 1 III 1 1 1 1 1 M h M M 1 1 1 1 1 1 III II 1 1 1 1 1 1, I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 
TATGATAAATTC^GGGCACCCCCO^CAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 0 

Mill II llllll I I I llllll IIIIIIII I IIIIIIII II MUM 

TATGACAAGTTCAAGCAGAGCACACACAGCAT^ 6 8 0 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 90 

II Mill II II IIIIIIII MMMM 'IIIIIIIIIIIII MMIMIMM 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAAAAGTGGAGC^GCA.CGTGGAGCTATAC(^GAAATA(^Ga\ATGATTCCTGGCGC 550 

MIMMIMI MMMM MMMM I M 1 1 M 1 1 1 1 1 1 1 1 M IMMMMI 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACC^GAAATACAGC^a^TTCCTGGCGA 8 00 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 610 

I M LI 1 1 M M I II I II 1 1 1 1 1 Mill MM IIIIIIIII MM IIIIIIII 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 8 60 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 0 

1 1 1 1 1 1 1 M 1 1 1 III I M II , III MM III II II Mill IIIIIIII M 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 
GCCCACTCTTCCTCTGACAGCAAAGATAAC^ 73 0 

Illllll MM MIMMMMMMMMM II Mill II 'MINIMI I 

GCCCACTGCTCCTGTGACAGOWVGATAACACACTGC^GTGGAC^TCT^CGGGTTCAC^ 98 0 



Qy 


11 


Db 


261 


Qy 


71 


Db 


321 


Qy 


131 


Db 


381 


Qy 


191 


Db 


441 


Qy 


251 


Db 


501 


Qy 


311 


Db 


561 


Qy 


371 


Db 


621 


Qy 


A 1 1 

4 J 1 


Db 


681 


Qy 


491 


Db 


741 


Qy 


551 


Db 


801 


Qy 


611 


Db 


861 


Qy 


671 


Db 


921 



Qy 


731 


Db 


981 


Qy 


791 


Db 


1041 


Qy 


851 


Db 


1095 


Qy 


911 


Db 


1122 


Qy 


971 


Db 


1182 


Qy 


1031 


Db 


1242 


Qy 


1091 


Db 


1302 


Qy 


1151 


Db 


1362 


Qy 


1211 


Db 


1422 


Qy 


1271 


Db 


1482 


Qy 


1322 


Db 


1542 



I II MM MMMMIMMI MMI MMMMMMM MMMM Ml 



ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 5 0 

MMMMMMMMMMMMM 1 1 MMI II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 

GATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGC 910 

Illlllll i I I I I I I I I I I I I I I I I 

GCCCTGGACACCAACTACTGCTTCAGC 1121 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 970 

Mill I 1 1 1 1 ! 1 1 1 1 1 1 I E 1 1 1 ; i 1 1 1 1 1 II lllllllllll MMMM III 

TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGC 1181 
TGGAAGTGGATTCATGAA C C CAAGGG CTA CCATG CCAATTT CTG CCTGGGG CCCTGTC C C 103 0 

lllllllllll M II llllllllllllllllllll IIIIIIIIIIIIIIIIIIMI 

TGGAAGTGGATCCACGAGCC CAAGGG CTACCATGCCAACTT CTG CCTGGGG CCCTGTCCC 1241 
TACATCTGGAGCCTAGACACTC^GTACAGCAAGGTCCTGGCTCT 1090 

Mill MMMM Mill llllllllllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

TACATTTGGAGCCTGGACACGCAGTA(^GCAAGGTCCTGGCCCTGTA(^CCAGaVTAAC 13 01 
CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 115 0 

Illlllll ! 1 1 1 1 1 II I I MIL lllll 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 : 1 1 

CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1361 
TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1210 

II I II II II 1 1 II M 1 1 1 1 1 M II 1 1 1 II I II M M I II II I II II II II I M lllll 

TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 
AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 12 7 0 

II llllllllllllllllllllllll Illlllll Mill IIIIMI 

AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCCCGCCCCAC 14 81 



Ml III II I M Ml I MM II I IN 1 1 1 1 1 1 II! I II 1 1 M I 

CCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCG 

CTTGGGATCGATTAAA 133 7 
I MM I I II 



RESULT 14 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 
DT 09-JAN-2003 (updated) 
DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth factor - Betal. 



XX 

KW HIV; AIDS ; SIV; vaccine; AZT; CD4 ; cytokines; growth 
KW factors; ds . 
XX 

OS Cebus apella. 
XX 

FH Key 
FT CDS 
FT 

FT mat_peptide 
FT 
XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 
XX 

PF 25-AUG-1989; 
XX 

PR 25-AUG-1988; 
XX 

PA (ONCO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 
XX 

DR WPI; 1990-068723/10. 
DR P-PSDB; AAR05663 . 
XX 

PT Compsns . contg. transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 
CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 



Location/Qual if iers 
267. .1437 
/*tag= a 
1103 . . 1437 
/*tag= b 



89EP-0115719 . 
88US-0236698 . 



Query Match 72.6%; Score 979.6; DB 11; Length 1560; 

Best Local Similarity 85.6%; Pred. No. 8.2e-191; 

Matches 1144; Conservative 0; Mismatches 144; Indels 48; Gaps 3; 
Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 70 

III 1 1 1 1 1 1 IIIIIMIIII : 1 1 1 1 1 M 1 1 1 1 1 llllllllllllll || Ml 

Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 

Qy 71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 0 

MINIMI IIIIMil IIIM I II 1 1 1 h 1 1! I II llllllll MMII 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 



Qy 131 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 190 

I II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 MMMMMI Mill Ml 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

Qy 191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 25 0 



Db 


441 


Qy 


251 


Db 


501 


Qy 


311 


Db 


561 


Qy 


371 


Db 


621 


Qy 


431 


Db 


681 


Qy 


491 


Db 


741 


Qy 


551 


Db 


801 


Qy 


611 


Db 


861 


Qy 


671 


Db 


921 


Qy 


731 


Db 


981 


Qy 


791 


Db 


1041 


Qy 


851 


Db 


1095 


Qy 


911 


Db 


1122 


Qy 


971 


Db 


1182 


Qy 


1031 



1 1 i 1 1 1 1 II 1 1 1 1 II , I II llllllllll Illllll lllll II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 500 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

IIMIIII llllllllllllll IIMIIII 1 1 1 1 II llllllllll II Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCGCGAACCGGAG 560 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 37 0 

M 11111:11 lllllllll, IIMIIIMI lllllllll II; I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 430 

Mill II llllll I I I llllll MIIMII I llllllli II llllll 

TATGACT^GTTCAAGCAGAGC^CACACAGCATA 68 0 

CGGGAAG CGGTGC CGGAACCTGTATTGCTCT CT CGGG CAGAGCTG CG CCTG CTGAGG CT C 4 90 

II lllll II II IIMIIII IIMIIII llllllllllllll I III M 1 1 1 1 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 550 

Milium: mm mini mimimmmmim imiiimm 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGGGA 8 0 0 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCAGCGGAGTGGCTGTCCTTTGATGTC 610 

nmimi im.mi miii mm imiiimi mm imiiimi 

TA CCT CAG CAA C CGG GTG CTGG CG C C CAG CAACT CG CCGGAGTGGTTGT CTTTTGATGT C 8 60 



Illllll II IIIIMIII III MM Ml II II Mill MIIMII I 



GCCC^CTCTTCCTCTGAC^GC^AAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 73 0 

Illllll MM II M lllll II llllllllll I 

GCCCACTGCTCCTGTGACAGCAAAGATAACAG&CT 98 0 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 9 0 

I IIMIIII MIMIIMMIM lllll MIMMMMIM MIIMII III 

AC CGG CCGCCGAGGTGAC CTGG CCA.CAATTCATGGCATGAAC CGG CCTTT CCTG CTTCTC 104 0 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 50 

m ii 1 1 m m i ii 1 1 1 1 1 1 1 1 1 1 m mil m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGA 1094 



MIIMII 1 1 1 1 M I M 1 1 Mil . I 

-GCCCTGGACACCAACTACTGCTTCAGC 1121 



TCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGC 970 

i ii 'i 1 1 1 1 ii 1 1 1 1 1 ii mi 1 1 1 1 1 ii 1 1 1 ii milium miiimi mi 

T C CA CGG AG AAGAA CTG CTG CGTG CGG CAG CTGTATATTG A CTT C CG CAAGGA C CT CGG C 1181 
TGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCC 103 0 

1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 MMMMMMMMMMI 

TGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCC 1241 
TACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGC^ 1090 

Mill MIIMII Mill I II 1 1 1 1 ' I lllllllll . M II 1 1 1 M I II 1 1 Ml 



Db 



124 2 TAC^TTTGGAGCCTGGACACGCAGTAGAGCAAGGTCC^ 13 01 



Qy 1091 CCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1150 

Illlllll 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 , 1 1 : II M II I I I I I I I I I I I I I ; I II I I MM 

Db 13 02 CCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTG 1361 

Qy 1151 TACTACGTGGGGCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGC 1210 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 

Db 13 62 TACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGA 1421 

Qy 1211 AAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCAC 1270 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I II I I I Mill lllllll 

Db 1422 AAATGCAGCTGAGGCCCCGCCCCGCCCCGCCCCACCCCGGCAGGCCCGGCCGCGCCCCAC 14 81 

Qy 12 71 CCCCGCCGGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAAGCCCA 1321 

III Ml M I I I I I I I I M I I I I I I I I I I lllllllllllllll 

Db 14 82 GCCACCCCCGCTGTCTTGCCCTTGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGCCCA 1541 

Qy 1322 CTTGGGAT CGATTAAA 1337 

I MM I I II 
Db 1542 CCTGGGGCCCCATTAA 1557 

RESULT 15 
AAT05876 

ID AAT05876 Standard; CDNA; 2745 BP . 
XX 

AC AAT05876; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE cDNA encoding transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta ; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment; ds . 

XX 

OS Mammalian sp. 
XX 

FH Key Location/Qualifiers 
FT CDS 842.. 2017 

FT /*tag= a 

FT /product^ transforming growth factor-beta 1 

XX 

PN W09526745-A1 . 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03705 . 
XX 

PR 05-APR-1994; 94WO-US03 7 05 . 
XX 

PA (HARD ) HARVARD COLLEGE. 
XX 

PI Lee M, Perrella MA; 
XX 



DR WPI; 1995-358443/46. 

DR P-PSDB; AAR83054. 
XX 

PT Treatment of hypotension, esp. in septic shock - by administering 

PT transforming growth factor-beta e.g. to inhibit inducible nitric 

PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 15; 52pp; English. 
XX 

CC The cDNA encodes transforming growth factor-beta 1 (TGF-beta 1) which 

CC has been found to inhibit inducible nitric oxide synthase (iNOS) gene 

CC transcription, esp. in interleukin- 1 -beta (ILl-beta) stimulated rat 

CC smooth muscle cells, and at a dose which does not inhibit constitutive 

CC NOS. TGF-beta 1 or 2 (AAR83055) or their active fragments, can be used 

CC in the treatment of hypotension, such as that associated with severe 

CC inflammation or septic shock. 
XX 

SQ Sequence 2745 BP; 527 A; 938 C; 801 G; 479 T; 0 other; 

Query Match 72.5%; Score 977.8; DB 16; Length 2745; 

Best Local Similarity 85.3%; Pred . No. 2.1e-190; 

Matches 1146; Conservative 0; Mismatches 147; Indels 50; Gaps 3; 
Qy 6 C CGAGATGG CG C CTT CGGGG CTG CGG CTCTTG C CG CTG CTG CTGC CG CTG CTGTGG CTGC 65 

II III 1 1 1 I II II MUM I I , I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 'IN MINIMI Mill 1 1 1 1 1 1 1 1 1 II 1 1 r r 1 1 1 1 J 1 1 1 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 95 6 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 II 1 1 I MM Mill 1 1 1 1 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I I I I I I I II I I I I I II I I II I I I I I I I I J I I 1 i I I I I 1 J I I J I Mill II II I 
Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M Ml MM I II I II II I II 1 1 1 M 

Db 1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 113 6 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill IIIIIMIIMMIII MMMMIMIMIMIMM II I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 366 AAATCTATGATAAATTC^GGGC^CCCCCC^CAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MIIMIMI II MUM I I MIMI Mill I Mill II I 

Db 1197 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCAT^ 12 56 

Qy 42 6 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT - - 483 

MM 1 1 1 1 1 1 1 1 II Mill II M MM I [ 1 1 1 1 1 1 1 1 1 1 1 ! Mill 

Db 1257 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 

Qy 4 84 -GAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGA^ 54 2 

I I I M I II II I II I I II I II II II II II I I I M M I MIMMMIMMIM III 



Db 1317 GGAGGCTCAAGTTAAAAGTGGAGCAGC^ 1376 

Qy 54 3 CCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 602 

lllllll II IIIIIMIII Mill! I Mill Mill II MUM I II I 

Db 1377 CCTGG CGATA CCTCAG CAA CCGG CTG CTGG CAC CCAG CGA CT CG CCAGAGTGGTTAT CTT 143 6 

Qy 603 TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 662 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 [ 1 1 1 1 Ml Ml Ml M II Mill MM 

Db 143 7 TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 14 96 

Qy 663 GCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACG 722 

MM M lllllll MM llllllll lllllllllll II Mill II MM 

Db 14 97 GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACG 1556 

Qy 723 GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 782 

Illlil I I llllllll lllllllllllllllllill lillllllllllll MM 

Db 1557 GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 1616 

Qy 783 TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 842 

MM MMMMIIMIMMIMMMMMIMI Mill II 1 1 1 1 1 1 II 1 1 1 1 1 1 

Db 1617 TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 1676 

Qy 843 GAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACT 9 02 

II llllllll llllllll I 

Db 167 7 GA GCCCTGGACACCAACTATT 1697 

Qy 903 GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 962 

Db 16 98 GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGG 1757 

Qy 963 ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 1022 

MM 1 : 1 1 1 1 1 ! 1 1 ! 1 II II lllllllllllllllllill llllllll MM 

Db 1758 ACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGC 1817 

Qy 1023 CCTGTCCCTA(^TCTGGAGCCTAGA(^CT(^GTACAGC^GGTCCTGGCTCTGTACAACC 1082 

MM llllllll llllllll Mill lllllllllll MM'!. MIIIIIMI 

Db 1818 CCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC 1877 

Qy 1083 AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 114 2 

MM lllllllllll III 1 1 II 1 1 1 1 1 1 1 1 1 1 1 Ml I Ml I II M 1 1 1 1 1 Ml MM 

Db 18 78 AG CATAACC CGGG CGC CT CGG CGG CG C CGTG CTG CGTG CCGCAGG CG CTGG AG CCG CTG C 193 7 

Qy 1143 CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 12 02 

Ml Illlil Illlil J 1 1 1 1 1 i 1 1 M 1 1 1 I I i ! 1 1 1 . [ I i I 

Db 193 8 CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 1997 

Qy 12 03 GTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCC 1262 

I 1 1 1 M II 1 1 Ml II I II 1 1 1 lllllllllll llllllll MM 

Db 19 98 GCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCC 2 057 

Qy 1263 GGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCC 1314 

MM Ml III II II 1 1 1 1 1 llllllll 

Db 2 058 CACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCC 2117 



Qy 1315 AAGCCCACTTGGGATCGATTAAA 133 7 

llllllll MM I I II 

Db 2118 AAGCCCACCTGGGGCCCCATTAA 214 0 



Search completed: October 27, 2003, 19:11:13 
Job time ; 415.282 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 27, 2003, 18:35:27 ; Search time 400.963 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 

US-10-017-372E-32 
1349 

1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 1349 



Scoring table: I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



3584790 



Post-processing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Publ ished_Appl icat ions_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: 

10: / cgn2_6 /pt oda ta 1 2 /pubpna /US0 9B_PUBCOMB . seq 

11: / cgn2__6 /p t oda t a / 2 /pubpna /US 0 9 C_PUBCOMB . s eq 

12: / cgn2_6 /p t oda t a / 2 /pubpna / US 0 9__NEW_PUB . s eq : 

13: / cgn2_6 /pt oda t a / 2 /pubpna /US 1 0A_PUBCOMB . s eq 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq 

15 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq: * 

16: / cgn2_6 / pt: oda t a / 2 /pubpna /US6 0_NEW_PUB . s eq : * 

17 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



: * 
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-09 


-833 


-381-1416 


Sequence 


1416, Ap 




3 1 


104 


. 8 


7 


. 8 


851 


13 


us 


-10 


-027 


-632-152938 


Sequence 


152938 , 




32 


102 


.2 


7 


. 6 


29000 


11 


us 


-09 


-906 


-158-17 


Sequence 


17, Appl 


c 


33 


100 


.4 


7 


. 4 


224 


10 


us 


-09 


-833 


-381-73 


Sequence 


73, Appl 




34 


79 


. 4 


5 


. 9 


450 


11 


us 


-09 


-918 


-995-6145 


Sequence 


6145, Ap 


c 


35 


69 


. 8 


5 


.2 


431 


9 


us- 


09- 


864- 


761-18644 


Sequence 


18644, A 


c 


36 


69 


.4 


5 


. 1 


176 


10 


us 


-09 


-833 


-381-571 


Sequence 


571, App 




37 


69 


5 


. 1 


584 


12 


us 


-10 


-029 


-386-9758 


Sequence 


9758, Ap 




38 


68 


.2 


5 


. 1 


176 


12 


us 


-10 


-029 


-386-23458 


Sequence 


23458, A 


c 


39 


64 


4 


.7 


363 


10 


us 


-09 


-833 


-381-585 


Sequence 


585, App 




40 


62 


. 6 


4 


. 6 


364 


12 


us 


-10 


-029 


-386-26476 


Sequence 


26476, A 




41 


62 


. 6 


4 


.6 


544 


12 


us 


-10 


-029 


-386-12776 


Sequence 


12776, A 




42 


62 


.6 


4 


. 6 


927 


12 


us 


-10 


-244 


-718-1 


Sequence 


1, Appli 




43 


57 


. 4 


4 


. 3 


486 


11 


us 


-09 


-918 


-995-25641 


Sequence 


25641, A 




44 


55 


.2 


4 


. 1 


594 


13 


us 


-10 


-027 


-632-141376 


Sequence 


141376, 




45 


53 


.2 


3 


. 9 


658 


9 


us- 


09- 


765- 


527-254 


Sequence : 


2 54, App 



ALIGNMENTS 



RESULT 1 
US-10-087-268-1 

; Sequence 1, Application US/10087268 



Publication No. US20030119010A1 
GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER : US/ 10/ 08 7 , 268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 1 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 
NAME/ KEY: 5 1 UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME/ KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY: sig_peptide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME/ KEY: 3 ' UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-1 

Query Match 72.8%; Score 981.8; DB 14; Length 1821; 

Best Local Similarity 85.7%; Pred. No. 3.6e-254; 

Matches 1134; Conservative 0; Mismatches 142; Indels 47; Gaps 2; 

Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III MM II lllllllllll lllllllllllll llllllllllllll I 
Db 5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 566 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I 1 1 1 1! I M i I M 1 1 1 1 1 MM; 1 1 Mill llllllllllllll IIIIMIIII 

Db 567 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 626 

Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

IIMIIII IIMMMII MM MIMI lllllllllll lllllllllll MM 

Db 627 AG CTGGTGAAG CGG AAG CG CAT CGAGG C CAT C CG CGG C CAG AT C C TGT C CAAG CTG CGG C 68 6 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I : I I ! I I I I I I I I I I I I I I I I I llllllllllllllllllll Illll II II I 
Db 687 TCG C CAG C CC CC CGAG C CAGGGGGAGGTG CCG C C CGG C C CG CTG C C CGAGG C CGTGCTCG 74 6 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II llllllll llllllllllllll IIMIIII MM llllllllllllllll 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCAC CCG CGTGCTAATGGTGGAAAG CGG CAACC 3 65 

I Illll MIMMMM MM MMMMMMMMMIIMMII I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCG CGTGCTAATGGTGGAAACCCACAA CG 8 66 



Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 4 25 

Milium ii mini i i Mm i iiiiiiii i iiiiiiii ii i 

Db 8 67 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 92 6 

Qy 42 6 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

1 1 1 1 1 1 1 IIIIIIII II Mill M IIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM i 

Db 92 7 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 986 

Qy 48 6 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 IMIII 

Db 987 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 104 6 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

1 1 1 1 III Mill III I II MM Mill Mill Mill II IMIII I II MM 

Db 104 7 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1106 

Qy 606 ATGTCA C CGGAGTTGTG CGG CAGTGG CTGA C C CG CAG AGAGG CTAT AG AGGGTTTTCG C C 665 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] I III III III II II Mill 1 1 1 1 1 1 1 

Db 1107 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1166 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAAGGGGT 72 5 

I II Ml MM MMMM 1 1 1 1 1 1 1 1 1 1 1 II Mill II 1 1 1 1 1 

Db 1167 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 122 6 

Qy 726 TCAATTCTGGCGGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 78 5 

III I I IIIIIIII 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 IMIII 

Db 1227 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 128 6 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I i I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill M II 1 1 II I M Ml 1 1 M 

Db 1287 TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA- 134 5 

Qy 84 6 CGCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

IIIIIMI IIIIIMI MM 
Db 134 6 GCCCTGGACACCAACTATTGCT 1367 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 1 1 1 1 1 f 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM II II 1 1 1 

Db 1368 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1427 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I MM MMMM II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIII IMIII 

Db 1428 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 14 8 7 

Qy 1026 GTCCCTAC^TCTGGAGCCTAGAC^CTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

I IIIIIIII MMMM Mill 1 1 1 1 1 1 1 : 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 148 8 GCCCCTACATTTGGAGCCTGGA(^CG(^GTA(^GCAAGGTCCTGGCCCTGTAC^ACCAGC 154 7 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCA.CTGCCCA 114 5 

I I I I I I I I 1 I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1548 ATAA C C CGGGCG C CTCGG CGG CG CCGTG CTGCGTG C CGCAGG CG CTGGAG C CG CTG CC CA 1607 

Qy 114 6 TCGTGTACTACGTGGGCCGC^GCCC^GGTGGAGC^GCTGTCa^CATGATCGTGCGTT 12 05 

I I M 11 1 1 1 1 1 1 i 1 1 1 1 1 M : 1 1 ! 1 1 1 1 1 1 1 ; I ! Ml 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 1608 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 



Qy 12 06 CCTGO^GTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

M M M 1 1 . 1 II 1 1 1 1 1 llllllllllll II I II I I I Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

Ml MM Ml II II Illllllllllllllllil llllllll I 

Db 172 8 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCGCAA 1787 

Qy 1318 CCC 1320 

II 

Db 1788 GCC 1790 



RESULT 2 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao , Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J. 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou , Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: Tillinghast, John 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. US20030104529Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/ 1 0/ 037 , 27 0 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER : 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 22 0 
LENGTH: 2742 
TYPE : DNA 

ORGANISM: Homo sapiens 



FEATURE : 
NAME/ KEY: 
LOCATION: 



CDS 
(842) 



(2014) 



US-10-037-270-220 



Query Match 72.8%; Score 981.4; DB 14; Length 2742; 

Best Local Similarity 85.5%; Pred. No. 5.1e-254; 



Matches 


1147; Conservative 0; Mismatches 146; Indels 48; Gaps 


3; 


Qy 


6 


CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 

II MINIMI 1 MIIIIIIIMII llllllllllllll 1 

CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 


65 


Db 


837 


896 


Qy 


66 


TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllll 1 II 1 1 1 1 1 1 1 

TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 


125 


Db 


897 


956 


Qy 


126 


AG CTGGTGAAG CGGAAG CG CAT CGAGGCCATTCG CGG C CAGATT CTGTC CAAG CTT CGG C 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AG CTGGTGAAG CGGAAG CG CAT CGAGGCCAT CCG CGG C CAGAT C CTGTC CAAG CTG CGG C 


185 


Db 


957 


1016 


Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


1017 


1076 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 

1 II Mil III llllllllllllll MINIM MM MIIIIIIIMII' 

CC CTGTACAACAG CAC C CG CGACCGGGTGGC CGGGG AGAGTG CAGAACCGGAG C C CGAG C 


305 


Db 


1077 


1136 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II M 1 1 1 II II 1 1 1 1 1 
CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


365 


Db 


1137 


1196 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

1 1 1 1 1 II 1 II II 1 1 1 II 1 1 1 WWW II II 1 II 1 1 II 1 II 1 II II 1 
AAATCTATGACAAGTTCAAGCAGAGTACACA 


425 


Db 


1197 


1256 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II II 1 II 1 1 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


1257 


1316 


Qy 


486 


GGCTC^GTTAAAAGTGGAGCAG^CGTGGAGCTATACCAGAAATACAGO^ATGATTCCT 

II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 II 1 1 
GGCTC^GTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGC^CAATTCCT 


545 


Db 


1317 


1376 


Qy 
Db 


546 
1377 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 

1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II INI 
GG CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


605 
1436 


Qy 


606 


ATGTCACCGGAGTTGTG CGG CAGTGG CTGAC C CG CAGAGAGG CTATAGAGGGTTTTCG CC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 III III III II II 1 1 1 1 1 1 1 1 1 1 1 1 
ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


665 


Db 


1437 


1496 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 
i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii i i i < i ii i i i i i i i 

1 II 1 II II 1 1 1 II 1 N 1 1 II II 1 1 II 1 1 1 1 II 1 II 1 II 11 II 1 1 1 1 1 M 

TTAUCUCCCAUTGCrCLTGTGACAGCAGG 


725 


DO 


i4yv 


1556 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 

Mill MM III 1 1 1 1 ! 1 1 1 1 ! Mill 1 ! 1 1 llllllllllllll Mill, 

TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 


785 


Db 


1557 


1616 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 1 M 1 ' 1 ! 1 1 1 1 1 1 1 1 , 1 (1 1 1 1 1 1 1 1 1 UNI MINNNMIININ 

TT CTCATGGCCACCCCG CTGGAGAGGG CCCA.G CAT CTG CAAAGCTCC CGG CAC CGCCGA- 


845 


Db 


1617 


1675 



Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 9 05 

I III Ml I II I II II I MM 

Db 1676 GCCCTGGACACCAACTATTGCT 1697 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

II 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II I 

Db 16 98 TCAG CT C CACGG AGAAG AA CTG CTG CG TG CGG CAG CTGTACATTGA CTT C CG CAAGGACC 1757 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 1025 

I Mill I II III III II M III I III I II I Mill III I Mill III III MM 

Db 1758 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1817 

Qy 1026 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

I I II II II I I MM I II Mill III Ml II Mill III III I III 1 1 II II MM 

Db 1818 GCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGC 1877 

Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM I: 

Db 18 78 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1937 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 12 05 

1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 I 

Db 1938 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 19 97 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 Mill I II Mill I 

Db 1998 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCGCCAC 2 057 
Qy 1266 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAA 1316 

III MM III II II I M 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 III III II 

Db 2058 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAA 2117 

Qy 1317 GCCCACTTGGGATCGATTAAA 133 7 

MUM Mill I II 

Db 2118 GCCCACCTGGGGCCCCATTAA 2138 



RESULT 3 
US-10-087-268-4 

Sequence 4, Application US/10087268 
Publication No. US2 003 01 1 9010A1 
GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER: US/10/087,268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 4 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 



NAME /KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME/ KEY : CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME / KEY : s i g__p ep 1 1 de 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME/ KEY: 3 1 UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-4 

Query Match 72.7%; Score 980.2; DB 14; Length 1821; 

Best Local Similarity 85.6%; Pred. No. 9.8e-254; 

Matches 1133; Conservative 0; Mismatches 143; Indels 47; Gaps 2; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II Ml 1 1 1 1 I! MM Mill III MM 1 1 1 1 1 1 1 1 1 1 1 E 1 1 I 

Db 5 07 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 5 66 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIMI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 567 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 62 6 

Qy 12 6 AG CTGGTGAAG CGGAAG CG CAT CGAGG C CATT CG CGG C CAGATT CTGT C CAAGCTTCGG C 185 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Mill ; 1 1 ! 1 1 1 1 1 ' I II I 

Db 627 AG CTGGTGAAG CGGAAG CG CAT CGAGGC CATC CG CGG CCAGATC CTGT CCAAGCTG CGG C 686 

Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill II II I 
Db 687 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 74 6 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I M I Ml 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 74 7 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 8 06 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I Mill I E 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 r I MM 

Db 8 07 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 8 66 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 4 25 

1 1 1 1 1 1 1 1 1 1 II MUM I I MINI MINIM I III MM II I 

Db 867 AAATCTATGACAAGTTCAAGCAGAGTACACAC 92 6 

Qy 4 26 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

MINN i Mill II Mill II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll 

Db 927 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 98 6 

Qy 48 6 GGCT(^^GTTAAAAGTGGAGC^G(^CGTGGAGCTATACCAGAAATAC^GCAATGATTCCT 54 5 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 

Db 987 GGCTC^GTTAAAAGTGGAGC^GC^CGTGGAGCTGTACCAGAAATAC^GCAACAATTCCT 104 6 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 6 05 

1 1 1 1 II Mill Mill II MM I II MM 

Db 104 7 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 1106 



Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

MM llllll I II I II llllll MM III 111 Ml II II Mill IMMII 

Db 1107 ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 1166 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

I II IMMII MM llllllll MINIMI ! I II Mill II IMMII 

Db 1167 TTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGT 122 6 

Qy 726 TCAATT CTGG C CG C CGGGGTGAC CTGGCCACCATTCACGG CATGAA C CGG CC CTT CCTG C 785 

III I I I : i I ; 1 1 1 IIIIMIIMMI llllll IIIIMMMMII IMMII 

Db 1227 TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGC 12 8 6 

Qy 786 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

I I i 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 ; 1 1 : 1 I I Mill IMIIIIIIIII II M 

Db 12 87 TTCTCATGGCCAC CC CG CTGGAGAGGG C CCAG CAT CTG CAAAG CTC C CGG CA C CG C CGA - 134 5 

Qy 84 6 CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

llllllll llllllll MM 
Db 1346 GCCCTGGACACCAACTATTGCT 1367 

Qy 9 06 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

1 1 1 1 1 i 1 1 1 M II 1 1 1 1 1 M I II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1: 1 llllll! 

Db 13 68 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACC 1427 

Qy 966 TGGGCTGGAAGTGGATT(^TGAACC(^GGGCTAC(^TGC(^ATTTCTGCCTGGGGCCCT 102 5 

I llllll II II I! II Ml 1 1 1 1 1 1 1 1 Ml! llllllll IMMII 

Db 142 8 TCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCT 1487 

Qy 1026 GTCCCTACATCTGGAGCCTAGACACT(^GTA(^GC^GGTCCTGGCTCTGTACAACCAGC 1085 

I llllllll llllllll Mill I III I I I I M I I I III I I I I MINI II III 
Db 14 88 GCCCCTAC^TTTGGAGCCTGGAC^CGC^GTAC^GC^AGGTCCTGGCCCTGTACAACCAGC 1547 

Qy 1086 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 

I I llllllll' 1 1 III I II Mi 1 1 Ml 1 1 II II I II IN I II M 1 1 1 i IMMII 

Db 154 8 ATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCA 1607 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA.GCTGTCCAACATGATCGTGCGTT 1205 

III IMMIII IIIIIIIMMIII II IMMII IMMIIIMI IMMII I 

Db 1608 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCT 1667 

Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

II llllllll !h III I I. llllllll Mill I 

Db 1668 CCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCAC 1727 

Qy 12 66 CCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAG 1317 

III MM III II II ! M 1 1 1 1 M I MM 1 1 II I llllllll I 

Db 1728 CCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCCCAA 1787 

Qy 1318 CCC 1320 

II 

Db 1788 GCC 1790 



RESULT 4 

US-09-948-002-28 

; Sequence 28, Application US/09948002 



Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F . Murray 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/ 09/ 94 8 , 002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER: 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 28 
LENGTH: 2 745 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (842) . . . (2017) 
US-09-948-002-28 

Query Match 72.5%; Score 977.8; DB 11; Length 2745; 

Best Local Similarity 85.3%; Pred . No. 4.8e-253; 

Matches 1146; Conservative 0; Mismatches 147; Indels 50; Gaps 3; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

II III 1 1 1 1 1 1 IMIMIIIII .111111, II MIL II! I 

Db 837 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 96 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I 1 1 1 1 1 1 ! 1 1 1 1 I . I MINIMI Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 8 97 TGGTGCTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 956 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 M I MM 

Db 957 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 1016 

Qy 186 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMM II II I 

Db 1017 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 1076 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II IMMIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1077 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 1136 

Qy 3 06 CAGAGG CGGA CTA CTA CG C CAAGG AGGT CAC CCG CGTG CTAATGGTGGAAAG CGG CAA CC 365 

I Mill 1 1 ! 1 1 1 1 I 1 1 1 1 1 1 [! I I 1 1 1 1 1 I 1 1 1 1 1 1 1 M I M I M 1 1 1 I I MM 

Db 1137 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 1196 

Qy 3 66 AAATCTATGATAAATTC^GGGCACCCCCCAC^GCTTATATATGCTGTTO^ACACGTCGG 425 

MMIIIMI II llllll I I MUM MMMM I MMMM II I 

Db 1197 AAATCTATGAQVAGTTCAAGCAGAGTACACACAGCAT^ 12 56 

Qy 4 26 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT - - 4 83 

Mill MMMM II Mill II MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 



Db 



1257 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 1316 



Qy 


484 


-GAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATT 


542 


Db 


1317 


IMIIIIMIIMMMIIIMIIMMIIIMIII II 1 1 1 1 1 1 1 II 1 1 1 M 1 III 

GGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATT 


1376 


Qy 


543 


CCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCT 

Illllll IMIIIMMMIIIIIIIMM IIIII Mill II llllll 1 II 1 

CCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTT 


602 


Db 


1377 


1436 


Qy 


603 


TTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTC 

IIIIMIIMIIMIIIMIIIIIIIMI Ml III III II 1 1 Mill MM 

TTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTC 


662 


Db 


1437 


1496 


Qy 


663 


GCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACG 

MM M MMIM MM MMMM MIMMMM II Mill M MM 

GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACG 


722 


Db 


1497 


1556 


Qy 


723 


GGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCC 

llllll 1 1 llllllll MIMIMMIIIMIIMI IMIIIIIIMIM MM 

GGTTCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCC 


782 


Db 


1557 


1616 


Qy 


783 


TGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCC 

MM MMMMMMMMMMMMMMMM Mill IIIIMIIMIIIMI 

TGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCC 


842 


Db 


1617 


1676 


Qy 


843 


GAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACT 


902 


Db 


1677 


M MMMM MMMM 1 

GA GCCCTGGACACCAACTATT 


1697 


Qy 
Db 


903 
1698 


GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGG 

lllllllll lllllllllllllllllllllllllllllll llllllllllllll MM 

GCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGG 


962 
1757 


Qy 

Db 


963 
1758 


ACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGC 

MM llllllllllllll II II llllllllllllllllllll llllllll MM 

ACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGC 


1022 
1817 


Qy 


1023 


CCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACC 

MM MMMM MMMM IIIII 1 1 1 II 1 1 1 II 1 II II II 1 II llllllllll 

CCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACC 


1082 


Db 


1818 


1877 


Qy 


1083 


AGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGC 

MM MIMMMM II II II 1 II 1 II II II 1 1 1 1 II II 1 II 1 1 II II 1 1 1 II MM 

AGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGC 


1142 


Db 


1878 


1937 






LUA. 1 LLj ILxlAL 1 ALU i UuCjLLbLAA.OLLLAAub I bOAGCAGCTOTCCAACATGATCGTGC 

II II II II 1 II II 1 II II II II II 1 1 1 II II M 1 1 II 1 1 II II II II II II II 1 Ml ! M 

CCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGC 


1202 


Db 


1938 


1997 


Qy 


1203 


GTTCCTGO^GTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCC 

1 II II II 1 1 II II II II II M Ml || | || | | | MMI 

GCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCC 


1262 


Db 


1998 


2057 


Qy 


1263 


GGCCCCACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCC 


1314 


Db 


2058 


MM MM III II M 1 II 1 1 1 1 II 1 1 M 1 1 1 1 1 1 MMMM 

CACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCGTGCCCC 


2117 



Qy 



Db 



1315 AAGCCCACTTGGGATCGATTAAA 13 3 7 

llllllll I II I I I II 
2118 AAGCCCACCTGGGGCCCCATTAA 214 0 



RESULT 5 
US-09-948-002-1 

; Sequence 1, Application US/09948002 
; Publication No. US20030050265A1 
; GENERAL INFORMATION: 
; APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F. Murray 
; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE : ISPH-0607 

; CURRENT APPLICATION NUMBER: US/09/948 , 002 
; CURRENT FILING DATE: 2000-09-05 

PRIOR APPLICATION NUMBER: 09/661,753 
; PRIOR FILING DATE: 2000-09-14 
; PRIOR APPLICATION NUMBER: 60/154,546 
; PRIOR FILING DATE: 1999-09-17 
; NUMBER OF SEQ ID NOS : 71 
; SEQ ID NO 1 

LENGTH: 2 094 

TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (868) . . . (2040) 
US-09-948-002-1 

Query Match 63.7%; Score 859; DB 11; Length 2 094; 

Best Local Similarity 81.5%; Pred. No. 4.1e-221; 

Matches 1038; Conservative 0; Mismatches 190; Indels 45; Gaps 2; 
Qy 6 CCGAG ATGG CG C CTT CGGGG CTG CGG CTCTTG C CGCTG CTG CTG C CG CTG CTGTGG CTG C 65 



Db 




Qy 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 



Db 




Qy 



12 6 AG CTGGTGAAG CGGAAG CG CAT CGAGG C CATTCG CGG C CAGATT CTGT C CAAG CTTCGG C 18 5 



Db 




18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 2 4 5 



Db 



1043 TCGCCAGTCCCCCAAGCCAGGGGGAGGTACCGCCCGGCCCGCTGCCCGAGGCGGTGCTCG 1102 



Qy 



24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 



Db 




Qy 



3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 



I II lllllllllll 1 1 II IMMIIMIIIMIIIIIMMMI II MM 

63 CCGAAGCGGACTACTATGCTAAAGAGGTCACCCGCGTGCTAATGGTGGACCGCAACAAC 



Db 


1163 


CCGAAGCGGACTACTATGCTAAAGAGGTCACCCGCGTGCTAATGGTGGACCGCAACAACG 


1222 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

lllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CC^TCTATGAGAAAACCT^AAGACATCTCACA.CAGTATATATATGTTCTTCAATACGTCAG 


425 


Db 


1223 


1282 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 


485 


Db 


1283 


1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 
ACATTCGGGAAGCAGTGCCCGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCTTGCAGA 


1342 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 


Db 


1343 


1 Ml Mi IIIIIIM M Mill M MMMMMI Mill MMM 

GATTAAAATCAAGTGTGGAGCAACATGTGGAACTCTACCAGAAATATAGCAACAATTCCT 


1402 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


1403 


lllllllll 1 M 1 1 1 II 1 1 II 1 1 II II II 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 
GGCGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 


1462 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


1463 


1 1 1 1 1 1 II 1 1 1 1 1 II II 1 M 1 1 1 II 1 II 1 Mill MMMMMM 

ACGTCACTGGAGTTGTACGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGAT 


1522 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCAGGTGGAAATTAACGGGT 


725 


Db 


1523 


MM M MM M 1 1 II II M 1 1 M II II II M M M M M M II 1 MMM 

TCAGCGCTCACTGCTCTTGTGACAGCAAAGATAACAAACTCCACGTGGAAATCAACGGGA 


1582 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 


785 


Db 


1583 


Ml 1 II Mill II II 1 1 1 1 1 II 1 1 II 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 1 
TCAGCCCCAAACGTCGGGGCGACCTGGGCACCATCCATGACATGAACCGGCCCTTCCTGC 


1642 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACGGCCGAG 

1 1 1 M 1 1 1 II II 1 1 II Mill II 1 M 1 1 II 1 1 II II 1 II 1 1 1 1 1 1 1 M 1 1 1 1 II 
TCCTCATGGCCACCCCCCTGGAAAGGGCCCAGCACCTGCACAGCTCACGGCACCGGAGA- 


845 


Db 


1643 


1701 


Qy 


846 


CCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 


905 


Db 


1702 


1 II 1 1 II II 1 M II II 1 MM 
GCCCTGGATACCAACTATTGCT 


1723 


Qy 


906 


TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 

1 1 1 1 M 1 1 II 1 1 M 1 II II 1 1 1 II II 1 II 1 1 1 II 1 1 1 II 1 M 1 1 1 1 II 1 II 1 1 1 1 
TCAGCTCCACAGAGAAGAACTGCTGTGTGCGGCAGCTGTACATTGACTTTAGGAAGGACC 


965 


Db 


1724 


1783 


Qy 


966 


TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 

MM MMMMMI II II II 1 II 1 1 1 1 M II II 1 1 II 1 MM II III 1 1 1 1 

TGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGACCCT 


1025 


Db 


1784 


1843 


Qy 


1026 


GTCCCTACATCTGGAGCCTAGA^CT^GTAOVGCAAGGTCCTGGCTCTGTACAACCAGC 


1 ORS 


Db 


1844 


1 MM M MMM MM MMMMMM II II llllllll 1 

GCCCCTATATTTGGAGCCTGGAC^aC^GTAC^GC^AGGTCCTTGCCCTCTACAACCAAC 


1903 


Qy 


1086 


ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 


1145 


Db 


1904 


1 1 II 1 1 1 1 1 1 II 1 MM 1 II III II M M 1 1 M III II imillllllllll 

ACAACCCGGGCGCTTCGGCGTCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCA 


1963 


Qy 


1146 


TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 


1205 



MM lllllllllll I M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill I 



Db 



1964 TCGTCTACTACGTGGGTCGC^GCCOVAGGTGGAGCAGTTGTCCAACATGATTGTGCGCT 2 023 



Qy 12 06 CCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGC 1265 

MIIIIIIIIIMMII IIIIMIIIIII Ml 1 1 1 IIIIMIMMIIII 

Db 2024 CCTGCAAGTGCAGGTGAAGCCCCGCCCCGC CCCGCCCCTCCCGGCAGGCCCGGC 2077 

Qy 12 66 CCCACCCCCGCCC 1278 

III lllllllll 

Db 2 07 8 CCCGCCCCCGCCC 2 090 



RESULT 6 

US-09-948-002-27 

/ Sequence 27, Application US/09948002 
; Publication No. US2 003 0050265A1 
; GENERAL INFORMATION: 
; APPLICANT : Nicholas M. Dean 
APPLICANT: Susan F. Murray 
; TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

/ CURRENT APPLICATION NUMBER: US/09/948,002 

; CURRENT FILING DATE: 2000-09-05 

/ PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,54 6 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 27 

LENGTH: 158 5 

TYPE: DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (413) ... (1585) 
US-09-948-002-27 

Query Match 62.2%; Score 839.4; DB 11; Length 1585; 

Best Local Similarity 81.9%; Pred. No. 7.2e-216; 

Matches 997; Conservative 0; Mismatches 181; Indels 39; Gaps 1; 
Qy 6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 65 

1 1 III MM IMMMMIIMI M II M I IMM M M I II M M I 

Db 4 08 CCCCCATGCCGCCCTCGGGGCTGCGGCTGCTGCCGCTTCTGCTCCCACTCCCGTGGCTTC 4 67 

Qy 66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

IMIMIMIMI M MM Mill Mill 1 1 1 1 II 1 1 II 1 1 1 1 1 II I II M II I 

Db 4 68 TAGTGCTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGG 527 

Qy 126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

iiiiiiiiii J 1 1 1 1 r 1 1 1 f r j 1 1 Mm m iiiiMii iiiiMii ii iii 

Db 528 AGCTGGTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGC 587 

Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I IMM MIMIMIMMMM M Mill MMMMIM Mill M M I 

Db 58 8 TCGCCAGTCCCCCGAGCCAGGGGGAGGTACCGCCGGGCCCGCTGCCCGAGGCGGTGCTCG 64 7 



Qy 246 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 305 

II I llllllll llllllllllllll II II II II I II llllllllllll 

Db 64 8 CTTTGTACAACAGCACCCGCGACCGGGTGGCAGGCGAGAGCGCTGACCCGGAGCCCGAGC 707 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 3 65 

I IIIIIIMIMIIIIIIIII II Illlllllll i: .11 II I I I I 

Db 7 08 CCGAGGCGGACTACTACGCCAAAGAAGTCACCCGCGTGCTAATGGTGGACCGCAACAACG 767 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 4 25 

I II II I II I III 1 1 1 Mill Mill MMIIII I Mill Mill I 

Db 768 C^TCTATGAC^U^CCAAAGAC^TCACAC^CAGTATATATATGTTCTTCAATACGTC^G 82 7 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 4 8 5 

I I llllllll Mill Mill MUM II II 1 1 1 1 M I Ml I II I II II 

Db 82 8 ACATTCGGGAAGCAGTGCCAGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCCTGCAGA 8 87 

Qy 4 8 6 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

I llllll M MMIIII Mill II II M M M I II II Mill MUM 

Db 88 8 GATTCAAGTCAACTGTGGAGCAACACGTAGAACTCTACCAGAAATATAGCAACAATTCCT 94 7 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 605 

MM Mill I II II I M I M II llllll Ml I M IMMMMM MM 

Db 94 8 GGCGTTACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTG 1007 

Qy 606 ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 665 

I Mill llllllll 1 1 1 1 f 1 1 1 1 1 1 1 1 M MM I Ml MM llllll 

Db 1008 ACGTCACTGGAGTTGTCCGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGCT 1067 

Qy 666 TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGT 72 5 

1 1 1 1 1 1 1 MM II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMMMMIMI M Ml 

Db 1068 TCAGTGCTCACTGCTCTTGTGACAGCAAAGATAATGTACTCCACGTGGAAATCAATGGGA 112 7 

Qy 72 6 TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 785 

III II MM Illlllllll llllll II I MINIMI Illlllllll 

Db 112 8 TCAGTCCCAAACGTCGAGGTGACCTGGGCACCATCCATGACATGAACCGACCCTTCCTGC 1187 

Qy 78 6 TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 84 5 

1 1 1 ! 1 1 1 . Mill! 1 1 Mill Mill II I !, 1 1 1 1 M 1 1 1 1 1 1 lllllll M 

Db 118 8 TCCTCATGGCCACCCCCCTGGAAAGGGCTCAACACCTGCACAGCTCCAGGCACCGGAGA- 124 6 

Qy 84 6 GCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCT 905 

Illlllllllllllllllllll 
Db 124 7 GCCCTGGATACCAACTACTGCT 1268 

Qy 906 TCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACC 965 

IMIIIMM llllllllllllll II llllllll IMMMMM lllllllll 

Db 126 9 TCAGCTCCACAGAGAAGAACTGCTGTGTACGGCAGCTGTACATTGACTTTAGGAAGGACC 1328 

Qy 966 TGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCT 102 5 

MM IMMMMM M M II II I II I II 1 II II 1 1 II I Mill Illlllllll 

Db 132 9 TGGGTTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGGCCCT 138 8 

Qy 102 6 GTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC 108 5 

I MMIIII llllllll Mill 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 II II llllllll I 

Db 13 8 9 GCCCCTACATTTGGAGCCTGGACACACAGTACAGCAAGGTCCTTGCCGTCTACAACCAAC 144 8 

Qy 108 6 ACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCA 114 5 



iiiiiiiiii ii ii ii i iiiiiiiiiiiiiiiiiiii rv, Mi h i 

Db 144 9 AO^CCCGGGTGCTTCCG(^T(^CCGTGCTGCGTGCCG(^GGCTTTGGAGCCACTGCCCA 15 08 

Qy 114 6 TCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTT 1205 

1 1 1 1 MINI II III MINIMI llllll 1 1 MM 1 1 II M II II 1 1 1 1 1 1 1 II I 

Db 150 9 TCGTCTACTACGTGGGTCGCAAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCT 1568 

Qy 12 06 CCTGCAAGTGCAGCTGA 1222 

I I I I I I I I I I I I 11 I I I 
Db 1569 CCTGCAAGTGCAGCTGA 15 8 5 



RESULT 7 

US-09-756-283A-19 

Sequence 19, Application US/09756283A 
Patent No. US20020151478A1 
GENERAL INFORMATION: 
APPLICANT: Cherna: ovsky , Yuti 
APPLICANT: Dre j a , Hanna Stina 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 
CURRENT APPLICATION NUMBER : US/09/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patent In version 3.0 
SEQ ID NO 19 
LENGTH: 13 76 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: LAP-mlFNbeta construct 
NAME/ KEY: CDS 
LOCATION: (1) . . (1368) 
US-09-756-283A-19 

Query Match 48.5%; Score 654; DB 10; Length 1376; 

Best Local Similarity 88.1%; Pred. No. 5.3e-166; 

Matches 724; Conservative 0; Mismatches 95; Indels 3; Gaps 1, 
Qy 11 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 70 

III MM II IMIMIMM .HI,, MMIMMIMM M Ml 

Db 1 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 60 

Qy 71 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 130 

1 1 1 1 1 1 II 1 1 ! 1 1 lllllllll Mill MMIMMIMM I II II 1 1 1 1 1 1 1 1 1 1 

Db 61 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 120 

Qy 131 GTGAAG CGGAAG CG CATCGAGG C CATTCG CGG CCAGATTCTGTCCAAG CTT CGG CTTG CC 190 

MMMMMMMMIMMMMI MMMMMI MMMMMI Mill Ml 

Db 121 GTGAAG CGGAAG CG CATCGAGG CCATCCG CGG CC^GATCCTGTCCAAGCTG CGG CTCGC^ 180 

Qy 191 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 250 

I M 1 1 1 1 1 1 1 1 1 1 M I! 1 1 1 i l i . i 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 Mill II II II II 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 240 



Qy 



251 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 



Db 



241 



MINIM MMMMMMM MMMM MM MMMMMMMMI Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 3 00 



GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 370 

M MMMMMMMMMMMMMMMMIMMMMM I MM MMI 

GCCGACTACTAGGCCAAGGAGGTCACCGGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 4 30 

MMI II MUM I I llllll MMMM I MMMM II MINI 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 4 2 0 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT - - - GAGG 4 87 

II MMMM II Mill II MMMM MMMMMMM Mill MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 4 8 0 
CT CAAGTTAAAAG TGGAG CAG CACGTGG AG CTAT ACCAGAAAT ACAG CAATG ATTCCTGG 547 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 MMMM 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 54 0 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill II llllll I II llllll 



GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 667 

II 1 1 1 1 1 1 1 1 II I II 1 1 1 1 II M I Ml Ml Ml II II Mill MMMM 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCGGTGGAGGGGAAATTGAGGGGTTTCGCCTT 660 
AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 727 

II MINI MM MMMM 1 1 1 1 1 1 1 1 1 1 1 M Mill II 'MMMM 

AGCGCGCACTGCTCCTGTGACAGCAGGGATAACACAGTGCAAGTGGACATCAACGGGTTC 72 0 
AATTCTGGCCGCCGGGGTGACCTGGGCACCATTCACGGCATGAACCGGCCCTTGCTGGTC 787 

I I I MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMMM MMMM 

ACTACGGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCGTTTCCTGCTT 78 0 
CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACGTGCACAGC 82 9 

1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Ml 

CTCATGGCCACCGCGCTGGAGAGGGGCCAGCATCTGCAAAGC 822 



RESULT 8 

US-09-756-283A-21 

Sequence 21, Application US/09756283A 
Patent No. US20020151478A1 
GENERAL INFORMATION: 
APPLICANT: Cherna jovsky , Yuti 
APPLICANT: Drej a , Hanna St ma 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 

CURRENT APPLICATION NUMBER: US/09/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 21 
LENGTH: 13 52 
TYPE: DNA 



Qy 


311 


Db 


301 


Qy 


371 


Db 


361 


Qy 


431 


Db 


421 


Qy 


488 


Db 


481 


Qy 


548 


Db 


541 


Qy 


608 


Db 


601 


Qy 


668 


Db 


661 


Qy 


728 


Db 


721 


Qy 


788 


Db 


781 



ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: mIFNbeta-LAP construct 
NAME/ KEY: CDS 
LOCATION: (1) . . (1344) 
US-09-756-283A-21 

Query Match 43.8%; Score 591.4; DB 10; Length 1352; 

Best Local Similarity 87.0%; Pred. No. 3.7e-149; 

Matches 662; Conservative 0; Mismatches 96; Indels 3; Gaps 1; 

Qy 73 GACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGT 13 2 

III II I III I II I III llllllllllllll lllllllllllllllll 
Db 582 GGCGGGAGGGGGCTCAGCGGCCGCACTATCCACCTGCAAGACTATCGACATGGAGCTGGT 641 

Qy 133 GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAG 192 

II! M i 1 1 1 1 h 1 1 1 1 1 1 1 M I, MINIUM 1 1 1 1 1 1 1 1 1 1 1 lllll Mill 

Db 642 GAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAG 7 01 

Qy 193 CCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTA 2 52 

Ml I ! M 1 1 1 M 1 1 1 1 1 1 I ; 1 1 1 1 M M M I : Illlll lllll II II II M II 

Db 7 02 CCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTA 761 



Qy 253 CAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGC 312 

Illlll MIMIIMIMM llllllll MM 1 1 1 1 r 1 1 1 1 1 1 f ! 1 1 1 1 Mill 

Db 762 CAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGC 821 

Qy 313 GGACTACTACGCC^GGAGGT^CCCGCGTGCTAATGGTGGAAAGCGGQy^CC^AATCTA 372 

M II II I II I I I I II II I II I II I I I II I I I I I I I II II I M I I MM lllllll 
Db 822 CGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTA 881 

Qy 373 TGATAAATTC^GGGC^CCCCCCAC^GCTTATATATGCTGTTCAACACGTCGGAGCTCCG 4 3 2 

III II Illlll I I Illlll llllllll I llllllll II llllllll 
Db 882 TGAQVAGTTCAAGCAGAGTACACACAGC^ 941 

Qy 433 GGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGGCT 48 9 

llllllll II Mill II llllllll MIMIIMIMM Mill Illlll 

Db 94 2 AGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGGCT 1001 

Qy 4 90 CT^GTTAAAAGTGGAGCAGCACGTGGAGCTATACC^ 54 9 

I II M M 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 II II II lllllllllllllllll llllllllll 

Db 1002 CAAGTTAAAAGTGGAGC^GCACGTGGAGCTC 1061 

Qy 550 CTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 60 9 

IMMMMMIMMMIMM lllll lllll M Illlll I II llllllll 

Db 1062 ATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 1121 

Qy 610 CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 66 9 

IM 1 1 II '1 1 1 II , 1 1 1 II Ml Ml Ml III II II Mill IIIIIIM II 

Db 1122 CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 1181 

Qy 67 0 TGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCa\CGTGGAAATTAACGGGTTa^ 72 9 

lllllll 1 1 1 1 llllllll llllllllll! II lllll II llllllllll 

Db 1182 CGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCAC 1241 

Qy 73 0 TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCT 78 9 

I I llllllll IIMIIIIIIIII III III I llllllllllllll llllllll II 



Db 1242 TACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCT 13 01 



Qy 7 90 CATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 83 0 

I M M I M 1 1 1 1 11 1 1 M 1 1 M I M I M M Mill Mil 

Db 13 02 CATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCT 134 2 



RESULT 9 

US-09-911-904-167 

Sequence 167 , Application US/09911904 
Publication No. US20030096234A1 
GENERAL INFORMATION: 
APPLICANT: Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Neft, Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER : US/09/9 1 1 , 904 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 386 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH: 489 
TYPE: DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (1) . . . (489) 

OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 

Query Match 27.2%; Score 366.4; DB 11; Length 489; 

Best Local Similarity 86.1%; Pred. No. le-88; 

Matches 445; Conservative 0; Mismatches 32; Indels 40; Gaps 2; 
Qy 774 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 83 3 

i i u 1 1 1 , 1 1 1 ii ii i; i ' i 1 1 m 1 1 1 ii 1 1 ! rri 1 1 ! 1 1 1 ii 1 1 1 1 1 l 1 1 h ii 

Db 1 GACCCTTCCTGCTCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCC 6 0 

Qy 834 GGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATA 8 93 

Mil Mill Illllllll I 

Db 61 GGCAGCGCCG GGCCCTGGACA 81 

Qy 8 94 C CAA CTA CTG CTTCAG CT C CA CGG AG AAGAA CTG CTG CGTG CGG CAG CT CTA CATTGA CT 953 

!. 1 1 1 1 1 1 1 1 1 1 III M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III I M IIIIIIIMM!!!!!!!! 

Db 82 CC^ACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACT 141 

Qy 9 54 TCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCT 1013 

1 1 1 1 Mill I. Ml, Mill' Mill llllllll Mill II II MM 

Db 142 TCCGCAAGGATCTGGGCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCT 2 01 

Qy 1014 GCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTC 1073 

1 1 M 1 1 II 1 1 1 II I f 1 1 1 1 1 1 MMMM Mill MMIIMIMMMIMM I 

Db 2 02 GCCTGGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCC 261 



Qy 


1074 


TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 

IMIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

TGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGG 


1133 


Db 


262 


321 


Qy 


1134 


AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACA 
1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 II II 1 II II II M II MM 

AGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACA 


1193 


Db 


322 


381 


Qy 


1194 


TGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCG - CCCACAGCCCCGCCCACC 

1 1 1 r 1 1 1 1 1 1 iiiiiiiiiiiiiiiiiiiiiiiiiiiiii ii mi mi ii i 

TGATCGTGCGCTCGTGCAAGTGCAGCTGAGGCCCCGCCCCGTCCGGCAGGCCCCGCCCAC 


1252 


Db 


382 


441 


Qy 


1253 


CGGCAGGCCCGGCCCCACGCGCGCCCGCCTCACCGGG 128 9 

Illllll llllllll lllllllllll 1 Mill 
CGGCAGGNGCGGCCCCGCCCCCGCCCGCTGCGCCGGG 478 




Db 


442 





RESULT 10 
US-09-813-271B-1 

; Sequence 1, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION : 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alartis Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/813 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/02719 

FILING DATE: 12-Jul-95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/ DOCKET NUMBER: 4 -20039C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 339 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : doubl e 
TOPOLOGY : linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu . hTGF-betal (DSM 5656) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1 . .33 6 

OTHER INFORMATION: /product= "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 22.9%; Score 308.6; DB 10; Length 339; 

Best Local Similarity 94.4%; Pred. No. 3.3e-73; 

Matches 32 0; Conservative 0; Mismatches 19; Indels 0; Gaps 0; 
Qy 884 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 943 

MINIM MMIMI IN! h lllllll II III IUI, Mill Ml II I'l 1, 1 1 

Db 1 GCCCTGGACACC^ACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 6 0 

Qy 94 4 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1003 

I I 1 1 1 1 !l 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 ! 1 1 ' I I ! I ! 1 1 II I I 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 120 

Qy 1004 GCC^TTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1063 

Mill MMIMI MINIM 1 1 II 1 1 1 1 MMIMI Mill MINNINN 

Db 121 GCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAG 18 0 

Qy 1064 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1123 

MMIMI NIMMMINN I I : ' I 1 1 I ; ! 1 1 1 ' 1 1 

Db 181 GTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCG 24 0 

Qy 1124 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1183 

M I M ! 1 1 f I M 1 1 I !. M ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 M Mill 1 

Db 241 CAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 3 00 

Qy 118 4 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1222 

I I M M MM M M I Ml I, 1 1 1 1 M II 1 1 1 1 1 1 M M 

Db 301 CTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 33 9 



RESULT 11 
US-09-906-158-3 

; Sequence 3, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

; APPLICANT: Susan M. Freier 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-0257 

; CURRENT APPLICATION NUMBER: US/09/906,158 
; CURRENT FILING DATE: 2001-07-14 
; NUMBER OF SEQ ID NOS : 168 



SEQ ID NO 3 
LENGTH: 25 74 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 17.7%; Score 238.4; DB 11; Length 2574; 

Best Local Similarity 53.6%; Pred. No. 4.1e-54; 

Matches 668; Conservative 0; Mismatches 521; Indels 57; Gaps 6; 
Qy 36 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 95 

1 1 1 III I I III II Ml MINI II I I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 320 

Qy 96 GACTGTCOVCCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 155 

MINIM MM Ml I Ml I I I I III MM I I M MM 

Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGC^ 380 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

II I M Mill II MUM MM 1 1 1 1 1 1 1 1 1 1 I II II 

Db 381 TTAGGGGAC^GATCTTGAGCAAGCTCAGGCTC^CC^GCCCCCCTGAGCC^ACGGTGATGA 44 0 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 27 5 

I I II I II I II IMM I I I I I I I I I I I Mill II I I I I 

Db 4 41 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 276 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 32 0 

II II II MM I I I III MM II 1 1 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 321 A CG CCAAGG AGGTCA C C CG CGTG CTAATGGTGG AAAG CGG CAA C CAAAT CTATG ATAAAT 3 8 0 

I Mill MM I III I I I I I I II 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGG 614 

Qy 381 TCAAGGGCACCCCCGACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 0 

III I I I I I III I III II! 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 441 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 50 0 

II III II II Mill Mill I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 
Qy 501 TGGAGCAGCACGTGGAG CT ATAC CAGAAATA CAG CAATG 53 9 

Ml I I I I II II I I I II II 

Db 735 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 

Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 5 99 

MiMi iii ii iii ii i i ii i i 1 1 1 1 r 1 1 f 1 1 

Db 7 95 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 8 54 

Qy 600 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 65 9 

MMMMIMM I MMM IMMIII I MIMI I I III 

Db 855 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 



Qy 660 TTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACA CACTCCACGTGG 713 

I I I I I I I I I I I I I III I I II III I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 714 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 773 

I III III I I I I II III II III II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 774 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 8 33 

I III II I I I I III I I I I I II I 

Db 103 5 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 1094 

Qy 834 GG CACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAA GG 884 

I III I II II I I III III I I II II 

Db 10 95 TG ATGATT CC C C CA CA C CGG CT CGA CAA CC CGGG C CAGGGGGGT CAGAGG AAG AAG CGGG 1154 

Qy 885 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 94 4 

I MM Mill MINIMI II I MM MINIUM Mill I MM 

Db 1155 CTTTGGACACC7VATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 94 5 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 1004 

IMMMIMIM MM 1 1 1 1 1 1 1 : 1 1 ! 1 1 I lh IMIIMM III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 

Qy 1005 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1064 

MM II 1 1 1 1 M M II II Ml II I II MUM Mill II 

Db 12 75 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 1065 TC CTGG CT CTGTACAAC CAG CACAAC C CGGG CG CGT CGG CGG CG CCGTGCTGCGTG CCG C 1124 

I MM IIIMIMI I Mill I II II M MM IMMMIMI I 

Db 133 5 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 13 94 

Qy 1125 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1184 

Ml 1 1 1 1 1 F 1 1 III Mill ! 1 , 1 1 1 1 MM II Mill ' 1 1 1 1 1 M 1 1 

Db 13 95 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 14 54 

Qy 1185 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 123 0 

I IMIIMM I III M II II II II II I II MM I 

Db 14 55 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGAGACCCCAC 1500 



RESULT 12 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US200201 10833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia, Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER: US/10/028,158 

; CURRENT FILING DATE: 2001-12-20 

PRIOR APPLICATION NUMBER: US/09/38 0 , 662 
; PRIOR FILING DATE: 1999-12-21 
; PRIOR APPLICATION NUMBER: PCT/CA98 /0018 0 



PRIOR FILING DATE : 1998-03-05 
PRIOR APPLICATION NUMBER : US 60/039,919 
PRIOR FILING DATE; 1997-03-07 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE : Patentln version 3.0 
SEQ ID NO 2 0 
LENGTH: 2 574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (254) . . (1492) 
US-10-028-158-20 

Query Match 17.7%; Score 238.4; DB 13; Length 2574; 

Best Local Similarity 53.6%; Pred. No. 4.1e-54; 

Matches 668; Conservative 0; Mismatches 521; Indels 57; Gaps 6; 

Qy 3 6 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 95 

III III I I III II I I I IIMII II I I II 

Db 261 TGC^CTTGC^W^GGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 320 

Qy 96 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 15 5 

IIIIIIM MM Ml I III I I I I III MM I I M MM 

Db 321 CTCTGTCC^CTTGC^CCACCTTGGACTTCGGCCAC^TC^AGAAGAAGAGGGTGGAAGCCA 38 0 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

II I II Mill M MMM MM MMMMM I M II 

Db 3 81 TTAGGGGACAGATCTTGAG(^GCT(^GGCTCACCAGCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

MM I II I M Mill IIIIIIIIIII Mill IIMII 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 2 76 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 32 0 

II II II MM I I II II MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 321 ACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGO^ACCAAATCTATGATAAAT 3 8 0 

I II II I II II I IIMII I II II 

Db 555 ATGCC7W^GAAATC(^TAAATTCGACATGATC(^GGGGCTGGCGGAGCAC^ACGAACTGG 614 

Qy 381 TC^GGGC^CCCCCC^C^GCTTATATATGCTGTTC^C^CGTCGGAGCTCCGGGAAGCGG 44 0 

Ml I I I M III I II I I II 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 441 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 500 

II III II M Mill MM! ! 

Db 675 AAAATAGAACCAACCTATTCCGAGCT^GAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 
Qy 501 TGGAGCAGCACGTGGAG CTATACCAGAAATACAGCAATG 53 9 

Ml II II II II I I I II II 

Db 73 5 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 



Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 599 

IIMII II I II III III I II I I MMMMM 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 8 54 



Qy 600 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 65 9 

'lllllllllll I MINI llllllll I MINI I I III 

Db 8 55 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 660 TTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACA CACTCCACGTGG 713 

I I I II I I I I I II I III I I I I Ml I 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 714 AAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACC 773 

lllllllll I IMIIII III II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 774 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 833 

I III I I I I I I III llllllll 

Db 103 5 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 10 94 

Qy 8 34 GG CACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAA GG 884 

I III I II II I I III III I Ml II 

Db 1095 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 885 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 94 4 

I 1 1 1 1 Mill IMIIII 1 . II I MM llllllllll II I II I MM 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 94 5 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 10 04 

1 1 1 1 1 1 1 1 1 1 1 II MM IIIIIMIIMIIII I 1 1 1 r 1 1 1 1 lllllllll III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 1274 

Qy 1005 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1064 

MM llllll II II II II Ml II I II llllll MUM II 

Db 1275 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 1065 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1124 

I MM lllllllll I Mill I II II II MM If 1 1 1 1 1 1 1 1 1 I 

Db 133 5 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 13 94 

Qy 1125 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1184 

III llllllll III II II I IMIIII MM M Mill MIIIIMM 

Db 13 95 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 14 54 

Qy 1185 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGC 123 0 

I Mull | Ml II II II II IMIIII MM I 

Db 14 55 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGAGACCCCAC 15 00 



RESULT 13 
US-09-957-458B-9 

; Sequence 9, Application US/09957458B 
; Publication No. US2 003 0 16627 1A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Una 

; TITLE OF INVENTION : Method for growing stem cells 

FILE REFERENCE: P66567US0 
; CURRENT APPLICATION NUMBER: US/09/957 , 458B 
; CURRENT FILING DATE: 2001-09-21 

PRIOR APPLICATION NUMBER: PCT/EP00/08247 



PRIOR FILING DATE: 2000-08-24 
PRIOR APPLICATION NUMBER: EP 99116533 
PRIOR FILING DATE: 1999-08-24 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patent In version 3.2 
SEQ ID NO 9 
LENGTH: 4 382 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 17.3%; Score 233.2; DB 12; Length 4382; 

Best Local Similarity 52.8%; Pred. No. 1.2e-52; 

Matches 660; Conservative 0; Mismatches 533; Indels 57; Gaps 5 
Qy 2 6 CTG CGG CT CTTG C CG CTG CTG CTG CCG CTG CTGTGG CTGCTAGTG CTGACG C CTGG CCGG 8 5 

1 1 1 1 I! I Ml 1 1 1 I I Ml I III IIMM I I 

Db 4 55 CTGCAGCCCATGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCA.CA 514 

Qy 8 6 CCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGC 14 5 

I I IMMIII MM II I III I I I I III MM I 

Db 515 ATCAGCCTCTCTCTGTCCACTTGCAC(^CGTTGGACTTCGGCC^CATa^AGAAGAAGAGG 574 

Qy 14 6 ATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAG 2 05 

I II MMM I II Mill II MUM MM MMMMM I 

Db 575 GTGGAAGCCATTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCA 634 

Qy 2 06 GGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGC 2 65 

1 1 MINI I II I II Mill 1 1 1 1 1 1 1 1 1 1 1 Mill 

Db 63 5 TCGGTGATGACCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGG 688 

Qy 266 GACCGGGTAGCCG GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 310 

M Mil MM I I I II I III 

Db 68 9 GAGTTGCTGGAAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAG 74 8 

Qy 311 GCGGACTACTACGCC^AGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 37 0 

I II Mill Mill III II I III I I I II 

Db 74 9 TCTGAGTACTATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCAC 8 08 

Qy 371 TATGATAAATTCAAGGGC^CCCCCCACAGCTTATATATGCTGTTC^ACACGTCGGAGCTC 43 0 

MM I M II I I II I I M III 

Db 8 09 AATGAACTGGCCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCC 8 68 

Qy 431 CGGGAAG CGGTG C CGG AA CCTG TATTG CTCTCTCGGG CAG AG CTG CG C CTG CTG AGG CTC 4 90 

III I I I MM I I I I Ml I II I 

Db 86 9 TCAGTGGAGAAAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCC 92 8 
Qy 4 91 A -AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAA 52 9 

I Mill MM II M Ml 

Db 92 9 AACCCCAGCTCCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGAT 988 

Qy 53 0 TACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCG 58 9 

I M I IMMM II II MM Ml II I I 



Db 



98 9 GAGCACATAGCCAAGCAGCGCTACATAGGTGGCAA 



1048 



Qy 590 GAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCT 64 9 

II IIIIIIII II IIIIIIII I MINI MINIM I llllll I 

Db 104 9 GAATGGCTGTCTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCC 1108 

Qy 65 0 ATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACA GCAAAGATAAC 700 

I Mill MM Mill I I I MM III I M 

Db 1109 AACTTGGGTCTGGAAATCAGCATCCACTGTC(^TGT(^{^CCTTTCAGCCCAATGGAGAC 1168 

Qy 701 ACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATT 760 

I III I I Mill I I Ml I 

Db 1169 ATACTGGAAAATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGAT 122 8 

Qy 761 CACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCAC 820 

M I M II Mill I I M Ml 

Db 122 9 GACCATGGCCGTGGAGACCTGGGGCGTCTCAAGAAGCAAAAGGATC 12 8 8 

Qy 8 21 CTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGG ATGAC 874 

Ml I I I M II I II III I III I 

Db 12 89 CTGATCCTC^TGATGATCCCCCCAC^CCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGG 134 8 

Qy 875 GACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTG 934 

I I I lllllllll Mill lllllllll II I MM M l MM II 

Db 13 4 9 AAGAAGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTA 14 08 

Qy 935 CGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAG 994 

M I II M MMMMMM MM M IMMMI Ml I M MM Ml 

Db 14 09 CGCCCCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAG 14 68 

Qy 995 GGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAG 1054 

II MMMMMM llllll II III Ml III llllll 

Db 14 69 GGTTACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACC 1528 



Qy 1055 TACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGC 1114 

I Ml III II I IMMMI I llllll MM II MM III 

Db 152 9 CATAGCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGC 158 8 

Qy 1115 TG CGTG C CG CAGG CG CTGGAG C CACTG C C CAT CGTGTA CTACGTGGG C CG CAAG C C CAAG 1174 

MM II MM MUM Ml MM MMI WWW I I WWW 

Db 158 9 TGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAG 1648 

Qy 1175 GTGGAG CAG CTGT C CAA CATG AT CGTG CGTT C CTG CAAGTG CAG CTGAGG 1224 

IIIIIIIIIIIIIIIMMII I III II II MMMMMM 

Db 164 9 GTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGAGG 1698 



RESULT 14 
US-09-906-158-10 

; Sequence 10, Application US/09906158 

; Publication No. US20030078217A1 

; GENERAL INFORMATION: 

; APPLICANT: Brett P. Monia 

; APPLICANT: Susan M . Freier 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

; FILE REFERENCE: RTS-0257 



CURRENT APPLICATION NUMBER: US/09/906,158 
CURRENT FILING DATE: 2 001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 10 
LENGTH: 287 9 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (611) . . . (1843) 
US-09-9O6-158-10 

Query Match 17.0%; Score 230; DB 11; Length 2879; 

Best Local Similarity 52.7%; Pred. No. 7.7e-52; 

Matches 655; Conservative 0; Mismatches 530; Indels 57; Gaps 5; 

Qy 36 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 95 

III III MINI I I I MINI II II 

Db 612 TGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACAATCAGCCTCT 671 

Qy 96 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 155 

MINIM MM II I Ml I I I I Ml MM I I II MM 

Db 672 CTCTGTCC^CTTGC^CC^CGTTGGACTTCGGCCACATC^AGAAGAAGAGGGTGGAAGCCA 731 

Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

M I M Mill M MMM MM ! M 1 1 II I M I II II 

Db 732 TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCATCGGTGATGA 791 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

I I II I II I M Mill 1 1 II II I M II Mill M III 

Db 7 92 CCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCACCCGGGAGTTGCTGG 84 5 

Qy 276 CCG GGG AAAGTGT CG AACCGG AG C C CG AG C CAG AGG CGG ACT ACT 320 

I MM II III I MM II MM 

Db 84 6 AAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAGTCTGAGTACT 905 

Qy 321 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 38 0 

I Mill Ml M I Mil I I II MM 

Db 906 ATGCCAAAGAGATCCATAAATTCGACATGATCCAGGGACTGGCGGAGCACAATGAACTGG 965 

Qy 381 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 44 0 

I M II I I II II II III III 

Db 966 CCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCCTCAGTGGAGA 1025 

Qy 441 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCA 4 91 

III II II I I I I III I M II 

Db 102 6 AAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 1085 

Qy 4 92 AGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATG 53 9 

M Ml MM I I II II I I M I 

Db 1086 CCAAGCGCACAGAGCAGAGAATTGAGCTCTTCCAGATACTTCGACCGGATGAGCACATAG 114 5 

Qy 54 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 59 9 

MIMM II II III II III I II I II MM 

Db 114 6 CCAAGC^GCGCTACATAGGTGGCAAGAATCTGCCCACAAGGGGCACCGCTGAATGGCTGT 12 05 

Qy 6 00 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 65 9 



I II MINIM I MINI MINIM I MUM I I MM 

Db 12 06 CTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCCAACTTGGGTC 1265 
Qy 660 TTCGCCTCAGTGCCCACTCTTCCTCTGACA G CAAAG AT AACACACTCCACG 710 

I MM Mill I I I I III III I Ml III I 

Db 12 66 TGGAAATCAGCATCCACTGTCCATGTCACACCTTTCAGCCCAATGGAGACATACTGGAAA 1325 

Qy 711 TGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGA 77 0 

I I I I I I I I Ml I M 

Db 132 6 ATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGATGACCATGGCC 1385 

Qy 771 ACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 83 0 

I I I M MM I I I M MUM I I 

Db 1386 GTGGAGACCTGGGGCGTGTCAAGAAGCAAAAGGATCACCACAACCGA.CACCTGATCCTC^ 144 5 

Qy 831 CCGGGCACCGCCGAGCCCTGGATACCAACGACTACAAGG ATGACGACGACAAGG 884 

I II M I II III I III I I I I M 

Db 1446 TGATGATCCCCCCACACCGACTGGACAGCCCAGGCCAGGGCAGTCAGAGGAAGAAGAGGG 1505 

Qy 885 CCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCT 944 

IMMM Mill 1 1 1 1 1 1 1 1 1 1 1 I MM 1 1 r 1 1 1 J 1 1 1 M III III 

Db 15 06 CCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTACGCCCCGTTT 1565 

Qy 945 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 1004 

I MM MINI MM II M.N I III I II Mill Mill Ml III 

Db 1566 ATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAGGGTTACTATG 1625 

Qy 1005 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 1064 

MM MMM II M II II Ml M I III IIMM I MM II 

Db 1626 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACCCATAGCACGG 1685 

Qy 1065 TCCTGGCTCTGTACAAGCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1124 

I II I II MMM I Mill I Mill II MM IMMMI II I 

Db 168 6 TGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGCTGCGTCCCCC 174 5 

Qy 1125 AGGCGCTGGAGCCACTGCGCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1184 

Ml IMMMI III Mill IMMM MMM I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1746 AGGACCTGGAGCGCCTGACCATCTTGTACTATGTGGGCAGAACCGGCAAGGTGGAGCAGC 18 05 

Qy 1185 TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGGTGAGGCC 1226 

1 1 II I II M M I Ml II II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

Db 18 06 TGTCCAACATGGTGGTGAAGTGGTGTAAGTGCAGCTGAGGCC 184 7 



RESULT 15 
US-09-813-271B-7 

; Sequence 7, Application US/09813271B 
; Patent No. US2 002 011583 4A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES : 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US2 002 0 1 15834Alart is Patent Department 
STREET: 564 Morris Avenue 



CITY: Summit 

STATE: New Jersey 
; COUNTRY: USA 

; ZIP: 07901 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patent In Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/ 02 71 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul~94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. coli LC13 7/pPLMu . TGF-betal ( 44 /4 5 ) beta3 
FEATURE : 

NAME /KEY : mat_peptide 

LOCATION: 1 . . 132 

OTHER INFORMATION: /product= "N- terminal 44 ammo 
acids of human TGF-betal" 
FEATURE: 

NAME/KEY: mat_peptide 
LOCATION: 133.. 336 

OTHER INFORMATION: /product = M C~ terminal 68 amino 
acids of human TGF-beta3" 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 1..336 

OTHER INFORMATION: /product^ "hybrid TGF-beta named 
TGF-betal -3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 16.8%; Score 227.2; DB 10; Length 336; 

Best Local Similarity 79.8%; Pred. No. 2.6e-51; 

Matches 268; Conservative 0; Mismatches 68; Indels 0; Gaps 0; 



Qy 8 84 GCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTC 943 



Ill III IIIIIIIIIIIMIIIIIIIIIIIIMIMIIIIIIMIM 

Db 1 GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTG 6 0 

Qy 944 TACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCAT 1003 

MINI hlllll MINIM Ml MINIM M II 1 1 II 1 1 1 1 1 1 M 1 1 1 

Db 61 TACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCAT 12 0 

Qy 1004 GCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAG 1063 

IMII MINI M M II II III II I II MUM MUM I 

Db 121 GCC^CTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACG 18 0 

Qy 1064 GTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCG 1123 

II 1 1 1 1 MINIMI I Mill I II II II MM 1 1 1 1 1 [ 1 1 1 1 1 

Db 181 GTGCTGGGAGTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCC 24 0 

Qy 1124 CAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAG 1183 

MM 1 1 1 1 1 1 1 1 IN INN 1 1 1 1 1 ! I II II II INN MM MM 

Db 241 CAGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAG 3 00 

Qy 1184 CTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGC 1219 

II 1 1 1 1 1 1 1 1 1 I Ml II II M II III 

Db 3 01 CTCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGC 336 



Search completed: October 28, 2003, 09:04:27 
Job time : 409.106 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on; 



Title: 



October 27, 2003, 10:47:27 ; Search time 3347.62 Seconds 

(without alignments) 
9794.056 Million cell updates/sec 

US-10-017-372E-32 



Perfect score: 1349 



Sequence : 



1 tggtaccgagatggcgcctt cgattaaagcggccgcgact 134 9 



Scoring table: I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database : EST:* 

1 : em estba : * 



2: em_esthum:* 

3 : em_est in : * 

4: em_estmu:* 

5: em_estov : * 

6: em_estpl:* 

7: em_estro:* 

8: em_htc. * 

9: gb_estl:* 

10: gb_est2:* 

11: gb_htc.-* 

12: gb_est3:* 

13: gb_est4:* 

14: gb_est5:* 

15: em_estfun : * 

16: em_estom:* 

1 7 : em_gs s_hum : * 

18: em_gss__inv : * 

19: em_gss_pln:* 

20: em_gss_vrt:* 

21: em_gss_fun: * 

2 2 : em_gs s_mam : * 

23: em_gss_mus:* 

24: em_gss_pro:* 

25: em__gss_rod : * 

2 6 : em__g s s_phg : * 

27: em_gss_vrl:* 

28: gb_gssl:* 

29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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ALIGNMENTS 



RESULT 1 
BM562135 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



1072 bp mRNA 
_MGC_118 Homo sapiens 



linear EST 20-FEB-2002 
cDNA clone IMAGE : 574 5463 



BM562135 

AGENCOURT_656203 2 NIH_ 
5 1 , mRNA sequence . 
BM562135 

BM562135 . 1 GI : 18807966 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1072) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl . gov 

Plate: LLAM12768 row: h column: 08 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



(MGC) 



FEATURES 

source 



BASE COUNT 
ORIGIN 



High quality sequence start: 9 
High quality sequence stop: 647. 

Locat ion/Qua 1 i f iers 

1. .1072 

/organism="Homo sapiens" 

/ mo 1 _ t yp e = " mRNA " 

/db_xref ="taxon: 9606" 

/clone*" IMAGE: 5745463" 

/ 1 i s s ue_t ype = " 1 eukocy t e " 

/lab_host="DH10B" 

/ c 1 one_l i b= " N I H_MGC_1 1 8 " 

/note="Vector : pCMV-SPORT6; Site__l : Not I; Site_2: EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 
non-activated adult donors. Library is oligo-dT primed 
and directional ly cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
1.2-3.3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027. Note: 
this is a NIH^MGC Library." 
217 a 356 c 313 g 186 t 



Query Match 53.0%; Score 715.6; DB 12; Length 1072; 

Best Local Similarity 82.5%; Pred. No. 2.4e-145; 

Matches 886; Conservative 0; Mismatches 144; Indels 44; Gaps 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



127 GCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCT 18 6 

M 1 , 1 1 1 1 1 1 1 1 1 1 1 1 1 'I II MIIMI IIIIIIIIMI MINIMI! Mill 

2 0 GCTGGTGAAGCGGAAGCGCATCTAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCT 7 9 
18 7 TGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGC 246 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II M 

8 0 CGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGC 13 9 
24 7 TCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCC 3 06 

II 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 M 1 1 1 MINIM MM 1 1 1 1 II 1 1 1 II 1 1 M 1 1 

14 0 CCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCC 199 
3 07 AGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCA 366 

INN ! 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I INI I 

2 00 TGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGA 2 59 
367 AATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGA 4 2 6 

MINIMI M MINI I I MINI MINIM I 1 1 1 1 1 1 1 1 II II 

26 0 AATCTATGACAAGTTC^GCAGAGTACAC^CAGCATATATATGTTCTTCAACACATC^GA 319 
42 7 GCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAG 48 6 

MINI I'M i: N INN II II, Nil l M 1 1 M 1 1 1 1 1 i i llllllll 

32 0 GCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAG 37 9 
48 7 GCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTG 54 6 

IIIIIMMIIMIIIIIIIMIIIMIIIMI IIIIIIIIIIMIIIII lllllll 

38 0 GCTCTUVGTTAAAAGTGGAGCAGCACGTGGAGCTGTACC^GAAATAC^GC^CAATTCCTG 43 9 
547 GCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGA 606 

III lllllll II I II I MINI III I Mill Mill II MINI I II Mill 



Db 



44 0 GCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGA 4 99 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



6 07 TGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCT 666 

M M M I II II M II M I II 1 1 1 II III III Ml II 1 1 Mill IMIMII 

500 TGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCT 559 
667 CAGTGCCCACTGTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTT 726 

II MMMI IMI IMIMII 1 1 1 1 1 1 1 1 1 1 1 II Mill II IMIMII 

560 TAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTT 619 
727 CAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCT 786 

II I I Ml IMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I M 1 1 1 1 1 1 1 M 1 1 IMIMII 

620 CACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCT 67 9 



787 



680 



CCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGGCGAGC 

M 1 1 1 M M II M II II II I M M M M II M Mill 1 1 M I II 1 1 1 II I II II I II 

TCTCATGGCCAGCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGC 



846 



739 



847 CCTGGATACCAACGACTACAAGGATGACGAGGACAAGGCCCTGGATACCAACTACTGGTT 906 

MMMI MMMI Mill 

74 0 CCCTGGACACCAACTATTGCTT 761 



907 



966 



CAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCT 

MMMI 1 1 : 1 1 1 1 : 1 1 . 1 1 1 1 1 1 1 1 1 . 1 ^ 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I MUM 

762 CAGCTCCCCGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAGGGACCT 821 
967 GGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTGTGCCTGGGGCGCTG 1026 

M 1 1 1 1 1 1 1 1 1 1 1 1 II II III I 1 1 1 1 1 1 1 1 1 II I IMIMII IV 

8 22 CGGCTGGAAGTGGATCCACGAGCCCCAAGGCTACCATGCCCACTTCTGCCTCGGGCCCTG 8 81 
1027 TCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCA 1086 

I III: IMIMII Mill I MMMI MIMI MMMI II II 

CCCCTACATTTGGAGCCTGGACACGCGTACCAGCAAGTCCCTGGCCCTGTACACCCGCCA 



882 



941 



1087 CAACCCGGGCGCGTCGGCGGCGCCGTGCTGC - -GTGCCGCAGGCGCTGGAGCCACTGCCC 1144 

MM II I 1 1 1 1 1 M 1 1 1 I III I M I I Mill II MIMI 

942 TAACCGGGCGCCCTCGGCGGCGCGGGTTTGCTGGGCCCCAGGCGCCTGGACGCCCTGCCC 1001 
1145 AT- -CGTGTACTACGTGGGCCGC- -AAGCCCAAGGTGGAGCAGCTGTCCAACAT 1194 

I I MMMI IMI Ml I MIMI III 1 1 MIMI MM 

1002 CTTCGGGGTACTACCTGGGGCGCCAAGCCCCAAGTGGGAACACCTGTCCCACAT 1055 



RESULT 2 

BX355682/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BX355682 1201 bp mRNA linear EST 05-MAY-2003 

BX355682 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CSODI002YJ05 3-PRIME, mRNA sequence. 
BX355682 

BX3 55682. 1 GI: 3 03 7198 7 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1201) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Full-length cDNA libraries and normalization 

Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / /www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DI002CE03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DI002CE03NP1 . 

Locat ion/Qua 1 if iers 

1. .1201 

/organism="Homo sapiens" 
/mol_type="mRNA " 
/db_xref ="taxon: 9606" 
/clone=" CS0DI002YJ05" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
198 a 326 c 386 g 253 t 38 others 



Query Match 52.7%; Score 710.4; DB 13; Length 1201; 

Best Local Similarity 82.8%; Pred. No. 3.4e-144; 

Matches 902; Conservative 8; Mismatches 126; Indels 53; Gaps 



Qy 


260 


ACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG - GCGGACTA 

1 llllllllllll llllll 1 - lllllllllllllllll III Ihllll 
AACCGCGACCGGGTGGCCGGG- -AGAKKCAGAACCGGAGCCCGAGCCTGAGCCCGRACTA 


318 


Db 


1046 


989 


Qy 


319 


CTACGCCAAGGAGGT(^CCCGCGTGCTAATGGTGGAAAGCGG(^ACCAAATCTATGATAA 


378 


Db 


988 


MllllllllllllhllllMIMMMIIIIMIII 1 MM 1 1 1 1 1 1 1 1 1 1 II 

CTACGCCAAGGAGGTMACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAA 


929 


Qy 


379 


ATTC^GGGOVCCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGC 


438 


Db 


928 


IMMI 1 1 MUM IIMIIII 1 IIIIMM II IMMMI Mill 

GTTCAAGCAGAGTACACACAGCATATATATGTTCT^ 


869 


Qy 


439 


GGTGCCGGAACCTGTATTGCTCT-CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAA 


497 


Db 


868 


Ml II 1 II M II IMMM 1 M 1 1 1 1 1 11 1 1 1 M 1 1 1 II 11 1 1 1 II 1 II 1 II 1 

GGTACCTGAACCCGTGTTGCTCTCCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAA 


809 


Qy 


498 


AAGTGGAGC^GCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCA 

1 M M 1 M M 1 M M 1 1 M 1 II M 1 1 M M 1 1 II 1 M II MIMIMII IMMM 

AAGTGGAGC^GCACGTGGAGCTGTACCAGAAATACAGO^CAATTCCTGGCGATACCTCA 


557 


Db 


808 


749 


Qy 


558 


GCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAG 

1 M II 1 1 1 1 1 II M II Mill Mill II IMMI 1 II 1 1 1 1 1 1 1 1 ! 1 f 1 1 1 1 1 

GCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAG 


617 


Db 


748 


689 



Qy 


618 


TTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 

llllllllllllll III III III II II II 1 1 MINIM II NIMH 

TTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACT 


677 


Db 


688 


630 


Qy 


678 


CTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 

MM IMIIMI MIMIMIM II Mill M IIIMIMM 1 1 MM 

GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 


737 


Db 


629 


570 


Qy 


738 


GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 


797 


Db 


569 


MM IMMMMMMMMMI MMMMMMM MMMM MMMMM 

GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 


510 


Qy 


798 


CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 

1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M Mill 1 ! 1 1 1 1 1 1 1 1 1 1 ' 1 ! 1 . 1 1 1 1 1 1 1 1 1 

CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 


857 


Db 


509 


454 


Qy 


858 


ACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGG 

MIMIII MIIIIMIIIMMI 

ACCAACTATTGCTTCAGCTCCACGG 


917 


Db 


453 


429 


Qy 


918 


AGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTGCGGAAGGACCTGGGCTGGAAGT 


977 


Db 


428 


IMM M M M 1 M M 1 1 1 II M 1 MMMMMMM MMMM MMMMM 

AGAAGNACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGT 


369 


Qy 


978 


GGATTCATGAACCCAAGGGCTACCATGCCAATTTGTGCCTGGGGCCCTGTCCCTACATCT 


1037 


Db 


368 


MM II II IMMMMMMMMMI MMMM IMIIMI MMMM 1 

GGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTT 


309 


Qy 


1038 


GGAGCCTAGAC^CTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCG 

1 II 1 1 II Mill IIIIMIIIIMIIIIIIII MIMMMIMM MMMMM 

GGAGGCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCG 


1097 


Db 


308 


249 


Qy 


1098 


CGTCGGCGGCGGCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACG 


1157 


Db 


248 


1 IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIII MIMIIMMMMMM 

CCTCGGCGGCGCCGTGCTGCGTGCGGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACG 


189 


Qy 


1158 


TGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCA 

MMMMMMMMMMIMIMMIIIMIMMIIIIMM IMIIIIIIIIM 

TGGGCCGCAAGGCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCA 


1217 


Db 


188 


129 


Qy 


1218 


GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCC 


1277 


Db 


128 


1 1 1 It 1 1 MIMMIMM MIMIII Mill MM MM II 

GCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCRGCAGGCCCGGCCCCACCCCGCCCCGCCC 


69 


Qy 


1278 


CGCCT CACCGGGGCTGTATTTAAGGACATCGTGCCCCAAGCCCACTTGGGAT 

1 M II IIIIIIIMMIII MM 1 MIMMMIM 

CCGCTGCCTTGCCCATGGGGGCTGTATTTAA-GACACCCGTCCCCAAGCGCACNNAMNNM 


1329 


Db 


68 


10 


Qy 


1330 


CGATTAAAG 1338 




Db 


9 


-Ihllh 
MDATHAAAS 1 




RESULT 3 
BX335351/ 


c 







LOCUS BX335351 983 bp mRNA linear EST 01-MAY-2003 

DEFINITION BX335351 Homo sapiens PLACENTA COT 2 5 -NORMALI ZED Homo sapiens cDNA 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



clone CS0DI013YH16 3-PRIME, mRNA sequence. 
BX335351 

BX335351 . 1 GI :30308367 
EST . 

Homo sapiens (human) 
Homo sapiens 

Chordata; 
Primates ; 



Eukaryota; Metazoa; 
Mammalia ; Eutheria ; 
1 (bases 1 to 983) 
Li,W.B. , Gruber,C. , 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



and Polayes,D. 



BASE COUNT 
ORIGIN 



Jessee, J . 

Full-length cDNA libraries and normalization 

Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 

http : //www. genoscope . ens . fr/cgi -bin/cluster . cgi?seq=CSODI 013DD08NP1 
&cluster=9160 . r . Contact : Feng Liang Email : fliang@lifetech.com 
URL : http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 013DD08NP1 . 

Location/Qualifiers 

1. .983 

/organ ism=" Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" CSODI 013 YH16 " 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
170 a 280 c 323 g 203 t 7 others 



Query Match 51.8%; 
Best Local Similarity 84.4%; 
Matches 857; Conservative 



Score 699.4; DB 13; Length 983 ; 
Pred. No. 7.9e-142; 
5; Mismatches 107; Indels 47; 



Gaps 



5; 



Qy 



Db 



278 GGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACC 337 

Mill 1 1 1 1 IIIIIIMIIIIIIIII INN MMIMIMIMIMMMIMI 

983 GGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCACC 924 



Qy 

Db 

Qy 
Db 



338 CG CGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCCAC 3 97 

II I MM III III lllhll MM MUM I III II MUM I I Ml 

923 CGCGTGCTAATGGTGGARACCCA.CAACGAAATCTATGACAAGTTCAAGCAGAGTACA 8 64 

3 98 AGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTG 4 57 

Ml IIIIMM I llllllll 1 1 IMIMII MIIIMI II Mill II III 

8 63 AGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTG 8 04 



Qy 4 58 CTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 517 

Mill I I I II I I I I I I I I I IIMMMMMIIMMMMIMIMMIMMIMI 
Db 8 03 CTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 74 4 



Qy 


518 


CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTC7VGCAACCGGCTGCTGGCCCCC 

II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 llllllllll II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
CTGTACCAGAAATACAGCAACAATYCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 


577 


Db 


743 


684 


Qy 


578 


AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 


637 


Db 


683 


II 1 II 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II III 1 
AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 


624 


Qy 


638 


CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 


697 


Db 


623 


M Mill 1 1 II 1 1 II M II 1 II 1 1 1 1 1 II 1 1 II 1 1 1 1 1 M II Ml 
CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 


565 


Qy 


698 


AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 


757 


Db 


564 


1 1 1 1 II 1 1 II Mill II II 1 1 1 II 1 1 1 1 1 llllllll 1 1 II 1 II 1 1 1 1 1 M 1 
AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 


505 


Qy 


758 


ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 

lllll 1 II 1 1 1 II 1 1 M II 1 1 II II 1 1 1 II 1 1 II 1 II 1 M II 1 1 II 1 II 1 1 1 1 1 II 1 
ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 


817 


Db 


504 


445 


Qy 


818 


CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGAC 


877 


Db 


444 


II lllll II II 1 II 1 II 1 II 1 II II 

CATCTGCAAAGCTCCCGGCACCGCCGA 


418 


Qy 


878 


GACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 

1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 II II 
GCCCTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 


937 


Db 


417 


364 


Qy 


938 


CAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGC 


997 


Db 


363 


II 1 II II 1 1 1 II II II 1 1 1 II M 1 1 1 1 1 1 1 II II II 1 1 II 1 M II IIIIIIIII 
CAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGC 


304 


Qy 


998 


TACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTAC 

1 1 1 1 1 II 1 1 M II 1 1 1 II 1 llllllll 1 II 1 1 1 1 1 1 II 1 II 1 1 Mill MUM 
TACCATGCC^ACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAM 


1057 


Db 


303 


244 


Qy 


1058 


AGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGC 


1117 


Db 


243 


: II II 1 1 1 M M 1 M II II 1 1 1 1 1 II 1 lllll Mill 1 II 1 II 1 1 1 1 1 1 1 1 1 
MMCAAGGTCCTGGCCMTGTACAACCAGCATAACCC - GGCGCCTCGGCGGCGCCGTNGT - C 


186 


Qy 


1118 


GTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 


1177 


Db 


185 


IMIIIIIIMIIMIIIII IMMIMMIMIIMMIMM 1 1 1 1 1 II 1 1 1 1 1 1 1 

GTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCTGCAAGCCCAAGGTG 


126 


Qy 


1178 


GAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCC 


1237 


Db 


125 


IIIIIIIIMIIIIIIIIMIMIM MIIIIIIMMIIIIIMI II 1 1 1 1 M 1 II 

GAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGAGGTCCCGCCCCGCC- 


67 


Qy 


1238 


ACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGGCTG 1293 


Db 


66 


II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
CCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTG 1 5 





RESULT 4 
BX349319 

LOCUS BX349319 900 bp mRNA linear EST 05-MAY-2003 



DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX349319 Homo sapiens B CELLS (RAMOS CELL LINE) COT 25 -NORMALIZED 
Homo sapiens cDNA clone CS0DL010YL07 5 - PRIME , mRNA sequence. 
BX349319 

BX349319 . 1 GI :30379410 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 900) 

Li,W.B., Gruber,C. f Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http ; / /www. genoscope . ens . f r/ 

cgi -bin/cluster. cgi?seq=CS0BAG059ZD04_CS05596__l&cluster=9160 . r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAG059ZD04_CS05596_1 . 

Location/Qualif iers 

1. .900 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone= " CS0DL01 0YL07 » 

/cell_type="B CELLS (RAMOS CELL LINE) COT 2 5 -NORMALIZED" 
/cell_line="RAMOS CELL LINE" 

/clone_lib="Homo sapiens B CELLS (RAMOS CELL LINE) COT 
25 -NORMALIZED" 

/note=" 1st strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 292 c 270 g 155 t 4 others 



Query Match 51.1%; 
Best Local Similarity 87.6%; 
Matches 753; Conservative 



Score 690; DB 13; Length 900 ; 
Pred. No. 8.5e-140; 
0; Mismatches 107; Indels 0; 



Gaps 



0, 



Qy 

Db 

Qy 
Db 

Qy 

Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III MM II llllll Mill MM Mill MM II MM !!!!!!!! I 

3 0 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 8 9 

66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 12 5 

I MIIIMMIIIIMIIIIMMMI Mill 1 1 1 1 1 1 1 1 1 1 1 II I IMIIIIIII 

90 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 14 9 

126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

MMMMMMMMMIIMMMMMI MIIMMIM MMMMMI MM 

150 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 2 09 



Qy 


186 


TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 llllllllllllllllllll 1 1 1 1 1 II II 1 
TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 


245 


Db 


210 


269 


Qy 


246 


CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 


305 


Db 


270 


1 II llllllll 1 1 1 II 1 II II 1 1 1 1 llllllll 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 
CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 


329 


Qy 


306 


CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 


365 


Db 


330 


1 Mill 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 


389 


Qy 


366 


AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 

Illlllllll II llllll 1 1 llllll llllllll 1 1 II 1 1 1 II II 1 
AAATCTATGACAAGTTC^GC^GAGTACAC^CAGCATATATATGTTCTTCAACACATCAG 


425 


Db 


390 


449 


Qy 


426 


AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 

lllllll llllllll II 1 1 1 1 1 II llllllll llllllllllllll lllllll 
AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 


485 


Db 


450 


509 


Qy 


486 


GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 


545 


Db 


510 


1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II lllllllllllllllll 1 II 1 1 1 
GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 


569 


Qy 


546 


GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTG 


605 


Db 


570 


MM 1 1 II 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 Mill 1 1 1 1 1 II llllll 1 II 1 1 1 1 
GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTG 


629 


Qy 


606 


ATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCC 


665 


Db 


630 


1 1 1 II II 1 II 1 II 1 1 II 1 II 1 1 II II 1 1 1 1 II 1 1 1 1 1 II 1 1 II 1 1 1 II 1 1 1 
ATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCC 


689 


Qy 


666 


TCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAAGGGGT 


725 


Db 


690 


1 1 1 II 1 II 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 II II II 1 1 II 1 1 1 1 1 1 1 II 1 
TTAGCGCCCACTGCTCCTGTGAC^GCAGGGATAAC^C^CTGC^GTGGACATCAACGGGT 


749 


Qy 


726 


TCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGC 
iii i i iiiiiiii iiiiiiiiiiiiiiiiiiii 11111111111111 ii i i i i 


785 


Db 


750 


M 1 1 1 1 II 1 1 II 1 1 1 1 1 1 II II 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 II II 1 II II 1 1 
TCACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTTCTGC 


809 


Qy 


786 


TCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAG 

1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M II II Mill IMIIIIIIIIIII MM 

TTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGNCGAG 


845 


Db 


810 


869 


Qy 


846 


CCCTGGATACCAACGACTAG 865 




Db 


870 


MUM 1 II II 1 
CCCTGGNACACCACTATTGC 88 9 





RESULT 5 
BX324511/C 

LOCUS BX324511 1041 bp mRNA linear EST 02-MAY-2003 

DEFINITION BX324511 Homo sapiens NEUROBLASTOMA COT 25 -NORMALI ZED Homo sapiens 

cDNA clone CS0DC024YD20 3-PRIME, mRNA sequence. 

ACCESSION BX324511 

VERSION BX324511.1 GI:30332381 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1041) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSOAC024DB10NP2&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSOAC024DB10NP2 . 

Location/Qual if iers 

1. .1041 

/organism="Homo sapiens" 
/ mo 1 __type = " mRNA 11 
/db_xref ="taxon: 9606" 
/cl one= " CS 0DC024YD2 0 " 

/tissue_type= "NEUROBLASTOMA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
176 a 283 c 346 g 219 t 17 others 



Query Match 48.8%; Score 658; DB 13; Length 1041; 

Best Local Similarity 81.3%; Pred. No. 8.1e-133; 

Matches 868; Conservative 10; Mismatches 135; Indels 55; Gaps 



8; 



Qy 


279 


GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCC 


338 


Db 


1022 


HI 1 =1 IIIIIIII! II III II •■!= h II 1 h Mill II 

SGGGAGAKGSAGAACCGGAGCCCGACCCTRASCCR- - -ACTAYACCCCAAGRAGTCACCC 


966 


Qy 


339 


GCGTGCTAATGGTGGAAAGCGGO^CCAAATCTATGATAAATTCAAGGGCACCCCCCACA 

Illlhllllhl MM 1 MM IIIMIMM II MMM 1 1 MM 

GCGTGYTAATGKT-GAAACCC^CAACGAAATCTATGACA^ 


398 


Db 


965 


907 


Qy 


399 


G CTTATATATG CTGTTCAA CACGTCGG AG CT C CGGGAAG CGGTG C CGG AA C CTGTATTG C 


458 


Db 


906 


M IIIIIIII 1 IIIIIIII 1 1 IIIIIIII MMIMI II Mill M MM 

GCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTGC 


847 


Qy 


459 


TCTCTC-GGG CAG AG C TG CG C CTG CTG AGG CT CAAG TTAAAAGTGG AG CAG CA CG TGGAG 


517 


Db 


846 


MM 1 IMMIMMMI 1 1 1 M M M M 1 1 1 M 1 1 M II II 1 1 1 1 M 1 M II M M 

TCTCCCSGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 


787 



Qy 



518 CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 577 



ii 1 1 1 1 1 1 1 1 : 1 1 ! 1 1 1 iiiiiiiiii iiiiiiiiiiiiiiiiiiiiiii in 

Db 7 86 CTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGQ^CCC 727 

Qy 578 AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 637 

II Mill II MINI I 1 1 IIIIIIIIIIIMIIIIIIIIIIIIIIIM III I 

Db 72 6 AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 667 

Qy 63 8 CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 697 

II III I I II II II I I II II M II Mlllll I I I I llllllll III 
Db 666 CGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGAT 608 

Qy 6 98 AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 757 

llllllll II Mill II IMMMIII I I llllllll ! MM 1 : 1 1 1 1 1 II I 

Db 6 07 AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 54 8 

Qy 7 58 ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 817 

Mill IIIIIIIIIMIII llllllll MMMMIMIIMMMMMMIMM 

Db 54 7 ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 4 8 8 

Qy 818 CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGAC 8 77 

M Mill II 1 1 1 1 1 II 1 1 1 M II M 1 1 1 1 1 1 II 

Db 487 CATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGAC 452 

Qy 8 78 GACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 93 7 

MMMM 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 i 1 1 1 1 1 M I M I 

Db 4 51 ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGG 4 07 

Qy 93 8 CAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGC 9 97 

Mill 1 1 i 1 1 1 1 1 M F 1 1 1 MMMM MIIMIIIIIMI II II IIIMIMI 

Db 4 06 CAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGC 347 

Qy 9 98 TACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTAC 1057 

IIMMIMM MMMM llllllll MMMM llllllll Mill MMM 

Db 34 6 TACCATGGCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAC 287 

Qy 1058 AGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGGCGTGCTGC 1117 

M II II II II M II IIIIIIIIIMIII Mill Mill II II M M M II M Ml 

Db 2 86 AGCAAGGTCCTGGCCCTGTACAACCAGCATAACCC - GGCGCCTCGGCGGCGCGGTG - TGC 229 

Qy 1118 GTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 1177 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! i , 1 1 1 1 MIIIIIIIIIMIIMIIIIIIIIMMIMIMMM 

Db 228 GTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTG 169 

Qy 1178 GAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCC 1237 

llllllllllllllllllllllllll IIMIIIIIIilllllllll 1 1 1 1 M 1 1 M 1 1 

Db 168 GAGCAGCTGTCCAACATGATCGTGCGCTCGTGCAAGTGCAGCTGAGGTCCCGCCCCGCCC 109 
Qy 123 8 ACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCT CA CCGGG 128 9 

llllllll Mill MM MM Ml M II Ml 

Db 108 CGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCCCGCCCCCGCTGCCTTGCCCATGGGG 4 9 

Qy 1290 GCTGTATTTAAGGACATGGTGCCCCAAGCCCACTTGGGATCGATTAAA 1337 

M 1 1 1 1 1 1 1 1 1 M 1 1 1 I II I II II M I M MM I : II 

Db 4 8 GCTGTATTTAAGGACACCCGTCCCCAAGGCCACCTGGSCACCCAYTAA 1 



RESULT 



6 



BX434425/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX434425 888 bp mRNA linear EST 15-MAY-2003 

BX434425 Homo sapiens PLACENTA Homo sapiens cDNA clone CS0DE014YE16 
3-PRIME, mRNA sequence. 
BX434425 

BX4 3442 5. 1 GI : 3 077 92 91 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria ; Primates; Catarrhini; Hommidae; Homo. 
1 (bases 1 to 888) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi ?seq=CSOBAK028AB08NMl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.mvitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK028AB08NM1 . 

Location/Qualifiers 

1. .888 

/organism="Homo sapiens" 
/mol_type= " mRNA " 
/db_xref = H taxon: 9606" 
/clone="CS0DE014YE16" 
/ 1 i s sue_type = " PLACENTA " 
/clone_lib="Homo sapiens PLACENTA" 

/note=" Vector: pCMVSP0RT_6; 1st strand cDNA was primed 
with a NotI -oligo (dT) primer. Five prime end enriched, 
double-strand cDNA was digested with Not I and cloned into 
the Not I and EcoRV sites of the pCMVSPORT 6 vector. 
Library was not normal ized . " 
156 a 253 c 302 g 176 t 1 others 



Query Match 45.9%; Score 619.2; DB 13; Length 888; 

Best Local Similarity 83.4%; Pred. No. 2.1e-124; 

Matches 770; Conservative 0; Mismatches 104; Indels 49; Gaps 



Qy 

Db 

Qy 
Db 



428 CTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGG 4 8 7 

MM MMMI M Mill I III I IIIIMIMIMM lllllll I 

888 CTCCAAAAAGCGGTACCTGAACCCTTGTTGTCTNCCCGGGCAGAGCTGCGTCTGCTGA-G 83 0 

4 88 CTCAAGTTAAAAGTGGAGOVGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 54 7 

M I M II I III 1 1 1 1 1 f I f 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM 

829 CTCAAGTTTAAAATGGAGCAGCACGTGGAGCTGTACCAGAAATA 770 



Qy 



54 8 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 607 

II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I III Mill M MUM I II MUM 



Db 


769 


CGATACCTCAGCAACCGGCTGCTGGCACACAGCGACTCGCCAGAGTGGTTATCTTTTGAT 


710 


Qy 


608 


GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 


667 


Db 


709 


IIIMIIIMMIMIIMIMII III III Ml 1 1 II Mill MINIM 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 


650 


Qy 


668 


AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 

II lllllll MM MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 IMM II 1 1 1 M 1 1 1 1 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 


727 


Db 


649 


590 


Qy 

Db 


728 
589 


AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 

1 1 1 MMMM 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 MMMMMMM MMMM 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 


787 
530 


Qy 


788 


CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 


847 


Db 


529 


MMIMIMMMIMMMMMMIIIM IMM II M 1 1 M II M 1 1 1! 1 1 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGA - - - 


473 


Qy 


848 


CTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTC 

MMMM MMMM MUM 

GCCCTGGACACCAACTATTGCTTC 


907 


Db 


472 


449 


Qy 


908 


AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTG 

1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 r 1 1 1 f 1 1 1 1 1 1 1 1 ! IMIMIIIIMM MMMM 

AGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTC 


967 


Db 


448 


389 


Qy 


968 


GGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGT 


1027 


Db 


388 


MMMMMMM M M 1 1 1 II 1 1 1 1 1 II II 1 II 1 II MMMM MMMM 

GGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGC 


329 


Qy 


1028 


CCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCAC 

MMMM MMMM Mill II 1 1 1 1 1 II II II 1 1 II 1 II MMMMMMM 

CCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCAT 


1087 


Db 


328 


269 


Qy 
Db 


1088 
268 


AACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATC 

MIMIMIM IIIMMMIIIIIIIIIIIIIIIIMIIIIIIIMM MIIIIMI 
AACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATC 


1147 
209 


Qy 


1148 


GTGTACTACGTGGGCCGCAAGCGCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCC 

1 1 M II 1 1 II M M M M 1 1 II 1 1 1 1 II M 1 1 1 1 1 1 1 II M 1 M M II M M 1 II 1 Ml 

GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGGTCC 


1207 


Db 


208 


149 


Qy 
Db 


1208 
148 


TGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCC 

1 1 II II M M M II M 1 1 1 1 f 1 1 1 1 1 1 1 1 II 1 II 1 1 1 Mill Ml 

TGCAAGTGCAGCTGAGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCACCC 


1267 
89 


Qy 


1268 


CACCCCCGCCCGCCT CACCGGGGCTGTATTTAAGGACA - TCGTGCCCCAAGG 

1 MM Ml II M MMMMMMMMMI MMIMMMI 

CGCCCCGCCCCGGCTGCCTTGCCCATGGGGGCTGTATTTAAGGACACCCGTGCCCCAAGC 


1318 


Db 


88 


29 


Qy 


1319 


GCACTTGGGATGGATTAAAGCGG 134 1 

MM MM 1 IMIMI M 

CCACCTGGGCCCCATTAAAGAGG 6 




Db 


28 





RESULT 7 
BM555996 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BM555996 1043 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_6544437 NIH_MGC__88 Homo sapiens cDNA clone IMAGE : 555003 9 
5', mRNA sequence. 
BM555996 

BM555 996. 1 GI : 187 96907 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1043) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

ht tp : / / image . 1 lnl . gov 

Plate: LLAM12261 row: i column: 16 

High quality sequence stop: 702. 
Locat ion/Qua 1 if iers 
1. .1043 

/ organ ism=" Homo sapiens" 
/mol_type="mRNA n 
/db_xref ="taxon: 96 06" 
/clone=" IMAGE: 5550039" 

/tissue_type="duodenal adenocarcinoma, cell line" 
/lab_host="DH10B (phage- resistant ) " 
/clone_lib="NIH_MGC_88 M 

/note=" Organ: small intestine; Vector: pCMV-SPORT6; 
Site_l: Not I; Site_2: Sail; Cloned unidirectionally ; 
oligo-dT primed. Average insert size 1.767 kb. Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 
202 a 329 c 345 g 154 t 13 others 



Query Match 45.9%; 
Best Local Similarity 82.0%; 
Matches 734; Conservative 



Score 619; DB 12; Length 1043; 
Pred. No. 2.5e-124; 
0; Mismatches 122; Indels 39; 



Gaps 



1; 



Qy 

Db 

Qy 
Db 

Qy 
Db 



3 96 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 4 5 5 

Mill II III III I I III Mil II II MUM Ml M Ml M MM! !! I 

1 AC^GCATATATATGTTCTTC^ACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 60 

4 56 TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 515 

IMMM 1 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 (( 1 1 1 1 1 1 1 1 i I II 1 1 1 1 ! 1 1 1 1 1 1 1 f I ! 1 1 1 1 1 1 

61 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 12 0 
516 AG CTATA C CAGAAATA CAG CAATGA TTCCTGG CG CTA C CT CAG CAACCGG CTG CTGG CC C 575 

MM MMMMMMMMI IMMMMI M II 1 1 1 II 1 1 M M II M II II I 

121 AGCTGTAC(^GAAATACAG(^CAATTCCTGGCGATACCTC^GCAACCGGCTGCTGGCAC 18 0 



Qy 


576 


CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
i i i i i i i i i ii i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i iii 

C CAG CGACT CG CCAGAGTGGTTAT CTTTTG ATGT CA C CGGAGTTGTG CGG CAGTGGTTGA 


635 


Db 


181 


240 


Qy 


636 


CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAG 


695 


Db 


241 


III Mill M 1 1 Ml II 1 M 1 1 1 1 1 II 1 M II I 1 II II 1 1 1 II 1 1 
GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCGTGTGACAGCAGGG 


300 


Qy 


696 


ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 

1 1 1 II II 1 II M 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 llllllll 1 1 II 1 1 II II 1 1 1 
i i i i i i i i i i ii i i i i i ii i i i i i i i i i i i i iiiiiiii i i i i i i i i i i i i i 

ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 


755 


Db 


301 


360 


Qy 


756 


CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGGTGGAGAGGGCCC 

1 II 1 1 1 1 lllllllllllill llllllll II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
iiiiiii iiiiiiiiiiiiii iiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiii 

CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 


815 


Db 


361 


420 


Qy 


816 


AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACGAGTACAAGGATGACG 


875 


Db 


421 


II 1 1 Mill 1 1 M M M M M M M 1 1 
1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

AGCATCTGCAAAGCTCCCGGCACGGCCGA 


449 


Qy 


876 


ACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 


935 


Db 


450 


1 1 1 M 1 1 1 1 1 II 1 II 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 II II II II 
GCCGTGGACACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGC 


501 


Qy 


936 


GGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGG 


995 


Db 


502 


1 1 II 1 II II 1 II 1 1 1 II 1 II 1 II II 1 1 1 1 II II 1 1 1 1 II II II II II 1 1 II II 1 
iiiiiii iiiiiiiiiiiiii iiiiiiii iiiiiiiiiiiiii ii ii iiiiiii 

GGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGG 


561 


Qy 


996 


GCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCGTAGACACTCAGT 

II M II II II M 1 1 II II M 1 M M M 1 1 II M M II llllllll Mill MM 


1055 


Db 


562 


■ i liiiiiiiiii iiiiiiii i iiiiiii iiiiiiii iiiiiiii 1 1 1 1 i 1 1 1 1 

GCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGGCTGGACACGCAGT 


621 


Qy 


1056 


ACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGGCGTGCT 


1115 


Db 


622 


1 1 1 II II 1 1 II II II 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 II II II II 1 II II 1 II 1 1 

llllllllllllllll lllllllllllill lllllllllll llllllllllllllll 

ACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCT 


681 


Qy 


1116 


GCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 


1175 


Db 


682 


MMIIIMIMMIMIMM MMMMIIIIMIMMMMMIIIIIMIMM 

GCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGG 


741 


Qy 


1176 


TGGAGCAGCTGT C CAACATGAT CGTG CGTT C CTG CAAGTG CAG CTGAGG CCCCGCCCCGC 


1235 


Db 


742 


IIIIIMMIMMIIIIIIMIIIMI IIIIIII Ml 1 1 II 1 

TGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAGTGCACCTTGAAGGGTCCGGCCCC 


801 


Qy 


1236 


CCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCCCGCCTCACCGGGG 12 90 




Db 


802 


1 II II Ml 1 II Ml Ml 1 M 

CACNCGGGNCNNNNNGGNNNNGGGGGGNCCCCGGCGCCGGGCCCCCGCGGCCCGG ft S6 





RESULT 8 
BX383773/C 

LOCUS BX383773 950 bp mRNA linear EST 08-MAY-2003 

DEFINITION BX383773 Homo sapiens HELA CELLS COT 2 5 -NORMALIZED Homo sapiens 

cDNA clone CS0DK001YA15 3-PRIME, mRNA sequence. 

ACCESSION BX383773 

VERSION BX383773.1 GI: 30457168 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 950) 

Li,W.B., Gruber,C. , Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi -bin/ cluster . cgi?seq=CS0DK001AA08NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.mvitrogen.com/ inVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODK001AA08NP1 . 

Location/Qualifiers 

1. .950 

/organism="Homo sapiens" 
/mol_type="mRNA M 
/db_xref ="taxon: 96 06" 
/clones" CS0DK001YA15" 

/cell_type="HELA CELLS COT 2 5 -NORMALIZED" 
/cell_line="HELA" 

/clone_lib="Homo sapiens HELA CELLS COT 25 -NORMALI ZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
171 a 260 c 312 g 195 t 12 others 



Query Match 4 5.4%; Score 612; DB 13; Length 950; 

Best Local Similarity 80.2%; Pred. No. 8e-123; 

Matches 78 9; Conservative 8; Mismatches 136; Indels 51; 



Gaps 



5; 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 



353 G AAAG CGG CAA CCAAAT CT ATG ATAAATT CAAGGG CA C C C C C CA CAG CTTAT AT ATG CTG 412 

hll I 1 1 1 1 MINIUM 1 1 MMII I : I IMMI IIMIMI I 

943 GRAACCCACAACGAAATCTATGACAAGTTCAAGCA^ 8 84 

413 TTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAG 4 72 

II I II : II 1 1 IIMIMI MM MM 1 1 I M II 1 1 MUM 1 1 MM Mill 

883 TTCAAMACATCAGAGCTCCGAGAAGCGGTACCTGVACCCGTGTTGCTCTCCCnnnCA.GAG 824 

4 73 CTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATAC 532 

Mill 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 . 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 ! , ! 1 1 1 1 , 1 1 1 1 1 1 1 1 1 M I 

823 CTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC 764 

533 AGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAG 592 

Mill II -WWW 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 Mill Mill II 1 1 1 

7 63 AGCAACAATAMCTGGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAG 704 



Qy 5 93 TGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATA 652 

III I II M 1 1 M M 1 1 1 1 M 1 1 1 M M M M 1 1 1 II Ml III Ml M II 

Db 7 03 TGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATT 644 

Qy 653 GAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTG 712 

M II IMMMI II lllllll MM I MUM III MMMI II Ml 

Db 643 GA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTTACAGCAGGGATTACACACTGCAAGTG 585 

Qy 713 GAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAC 77 2 

II M IMMMI I I I Illlllll 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 MUM 

Db 584 GACATCAACGGGTTTACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAAC 525 

Qy 773 CGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCC 832 

Mill MMMM M II II I M M M M M M II M M II M M I Mill MUM 

Db 524 CGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCC 4 65 

Qy 833 CGGCACCGCCGAGCCCTGGATACCAACGACTACAAGGATGACGACGACAAGGCCCTGGAT 8 92 

] 1 1 1 1 1 1 1 1 1 1 1 MMMM 

Db 4 64 CGGCACCGCCGA GCMCTGGAC 4 44 

Qy 8 93 AC CAACT ACTG CTT CAG CT C CACGG AG AAG AACTG CTG CGTG CGG CAG CT CTACATTG AC 9 52 

MMMM MIIMMMIMMMIIMMMMMMMMIIMM IMIIIMI 

Db 443 ACCAACTATTGCTTCAGCTCCACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTKAC 384 

Qy 953 TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 1012 

Mill MMMM MIMIMMMM II II MMMIMMMIMM I Ml 

Db 3 83 TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCGACTTC 324 

Qy 1013 TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 1072 

Mill IMMMI IMMMI MMMM Mill II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

Db 323 TGCCTCGGGCCCTGCCCCTA(^TTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCG 264 

Qy 1073 CTGTACAACCAGCACAACCCGGGCGCGTCGG CGG CGC CGTG CTGCGTGCCGCAGGCG CTG 1132 

I II III Mill I J J 1 1 1 J 1 1 1 1 1 f I IMIIIIIIIIIMMII 

Db 263 CCTGTACAACCAGCATAACCCGGCGCCTCGGCGGCGCCGTG- TGCGTGCCGCAGGCGCTG 2 05 

Qy 1133 GAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 1192 

mil 1 1 1 ; 1 1 - 1 1 : ; ■ 1 1 1 1 : 1 1 ! 1 1 1 1 mmmmmmimimmmimi m 

Db 2 04 GAGCCGCTGCCC^TCGTGTACTACGTGGGCKKC^AGCCCAAGGTGGAGCAGCTGTCCNAC 14 5 

Qy 1193 ATGATCGTGCGTTCCTGCAAGTGCAGCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACC 1252 

IMMMMM I M 1 1 1 1 1 II 1 1 II I II II IMMMMIM M I M 

Db 144 ATGATCGTGCGCTCCTGCAAGTGCAGCTGAGATCCCGCCCCGCCCCGCCCCGCCCCGGAA 8 5 
Qy 1253 CGGCAGGCCCGGCCCCACCCCCGCCCGC CTCACCGGGGCTGTATTTAAGGA 13 03 

II I II Ml MMMM I I M II M M I M II I II I I 

Db 84 GGCCGCGGCCCCACCCCGCCCCGCCCCCSCTGCCTTGCCCATGGGGGCTGTATTTAAGNA 2 5 

Qy 13 04 - CATCGTGCCCCAAGCCCACTTGG 132 6 

I I II MIMIIMM Ml 
Db 24 CCCCCCTGGCCCAAGCCCACCTGG 1 



RESULT 9 
AL530081 

LOCUS AL530081 859 bp mRNA linear EST 23-MAY-2003 

DEFINITION AL530081 Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALI Z ED Homo sapiens 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



cDNA clone CS0DD009YM06 5-PRIME, mRNA sequence 
AL530081 

AL530081.2 GI : 31067916 
EST . 

Homo sapiens (human) 
Homo sapiens 

Chordata; 
Primates ; 



Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
1 (bases 1 to 859) 
Li,W.B. , Gruber,C. , 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



and Polayes,D. 



BASE COUNT 
ORIGIN 



Jessee, J. 

Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793574. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http : / / www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CS0DD009BG03QPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODD009BG03QP1 . 

Loca t i on / Qua 1 i f i er s 

1. .859 

/organism="Homo sapiens" 
/mol_type= " mRNA 11 
/db_xref ="taxon: 9606" 
/clone= " CSODDO 0 9YM06 " 

/ tissue__type=" NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 272 c 257 g 148 t 3 others 



Query Match 42.8%; Score 577.4; DB 9 

Best Local Similarity 87.0%; Pred. No. 2.6e-115 
Matches 643; Conservative 2; Mismatches 93 



Length 85 9; 
Indels 1; 



Gaps 



Qy 

Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III 1 1 1 1 II I M 1 1 1 1 : ! 1 1 Ml MINIMI 1 1 1 1 1 1 : 1 1 1 1 ! 1 1 I 

121 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTAC 18 0 



Qy 

Db 

Qy 

Db 



66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

i u i m m m m " 1 1 1 1 1 1 : 1 1 1 1, Mm 1 1 1 1 1 1 1 1 1 : : : 1 1 mimiimi 

181 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 24 0 

126 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 185 

I II I II 1 1 1 M I M M 1 1 1 1 M 1 1 M 1 1 1 M MIMMMM 1 1 1 1 1 1 1 1 1 1 1 MM 

241 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 3 00 



Qy 



18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 



i : i mi 1 1 1 : 1 1 1 1 1:1 1 1 1 1 1, : imiiiiiiiiimiiiiii iiiii ii ii i 

Db 301 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 360 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 05 

I II MINIM 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 MMMM MM MMMMMMMM 

Db 3 61 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 42 0 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I II M I IMIMMMMMMMMIIMIMIMMMMMMMI I MM 

Db 4 21 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 4 8 0 

Qy 3 66 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 425 

MMIMMI II MUM I I M M II IMIIIM I llllllll II I 

Db 481 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCAT 54 0 

Qy 426 AG CT C CGGGAAG CGGTG C CGGAAC CTGTATTG CT CT CT CGGGCAGAGCTGCG C CTGCTGA 48 5 

IMIMI IMIIIM II Mill II 1 1 1 1 1 1 [ f IMIIMIIIIIII 1 1 1 II 1 1 

Db 541 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 600 

Qy 486 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

MMMMMMMIMMMMMMMIMM IMMMMMMMM MMM 

Db 6 01 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 660 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG-GCTGTCCTTT 604 

MM M 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 !l 1 1 II IIIII Mill II Mill I I II Ml 

Db 661 GGCGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGTGTTATCTTTT 72 0 

Qy 605 GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGC 664 

IIIIIIIIIIIIIIIIIMIIIIIMI III III Ml II II Mill MMM 

Db 721 GATGTCACCGGAGTTGTGCGGCAGTKGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGC 78 0 

Qy 665 CTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGG 724 

M II MMIM MM IMIIIM IMIMMIM M Mhl M MMM 

Db 781 CTTAGCGCCCACTGCTCCTGTGAC^GC^GGGATAAC^CACTGCAAGTGRACATC^ACGGG 84 0 

Qy 725 TTCAATTCTGGCCGCCGGG 743 

MM I I MMMM I 

Db 841 TTCACTACCGGCCGCCGAG 859 



RESULT 10 
BE312000 

LOCUS BE312000 713 bp mRNA linear EST 26-OCT-2000 

DEFINITION 601154768F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3510592 5 ! , 

mRNA sequence. 
ACCESSION BE312000 
VERSION BE312000.1 GI: 9130128 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 713) 

AUTHORS NIH-MGC http://mgcnci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 
JOURNAL Unpubl i shed 
COMMENT Contact: Robert Strausberg, Ph.D. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM195 row: h column: 17 
High quality sequence start: 2 
High quality sequence stop: 713. 

Locat i on/Qua 1 i f lers 

1. .713 

/organism* 11 Homo sapiens'* 

/mol_type=" mRNA " 

/db_xref ="taxon: 9606" 

/ clone=" IMAGE -.3510592" 

/t is sue_type=" neuroblastoma 11 

/lab_host="DH10B (phage-resistant ) " 

/ c 1 one_l ib= " N I H_MGC_1 9 " 

/note= "Organ: brain; Vector: pOTB7; Site_l: Xhol ; Site_2: 
EcoRI; cDNA made by oligo-dT priming. Direct ionally 
cloned into EcoRl/XhoI sites using the following 5' 
adaptor: GGCACGAG(G) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 
157 a 226 c 211 g 119 t 



Query Match 42.3%; Score 570.6; DB 10; Length 713; 

Best Local Similarity 87.5%; Pred. No. 7.4e-114; 

Matches 624; Conservative 0; Mismatches 89; Indels 0; Gaps 



Qy 
Db 



13 5 AGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCC 194 

llllllllllllllllllllll lllllllllll lllllllllll Mill lllllll 

1 AGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCC 60 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



195 CCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACA 254 

IIIIIIIIIIIIIIII M I ( 1 1 M 1 ! M I M I M I ! Mill II M M II MM 

6 1 CCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACA 12 0 
255 ACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGG 314 

MM MIMIMMIMI MMMM MM 1 1 II 1 1 1 1 1 II 1 1 1 II I Mill I 

121 A CAG CAC C CG CGAC CGGGTGG C CGGGG AGAGTG CAG AAC C GG AG C C CG AG C CTGAGG C CG 18 0 
315 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATG 374 

I I M M M II I M I M M II M M M M M M M II M I M I I MM I M M 



i i i i i i 



181 ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATG 24 0 

3 75 ATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGG 434 

I II MMM I I MUM MMMM I MMMM II MMMM I 

241 ACAAGTTCAAGCAGAGTACACAOVGC^TATATATGTTCTTC^CACATCAGAGCTCCGAG 3 00 



Qy 435 AAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGT 4 94 

lllllll M Mill II MMMM MIIMMMIMI i Mill! 1 1 1 M 1 1 1 1 

Db 3 01 AAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGT 360 



Qy 4 95 TAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACC 554 

IIIIIIIIMIIIIIIIIIIIIII Illllllllllllllll 1 1 1 1 1 M 1 1 1 MM 

Db 3 61 TAAAAGTGGAG CAG CA CGTGGAG CTGT AC CAGAAATACAG CAA CAATTC CTGG CGATA C C 420 



Qy 555 TCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCG 614 

1 1 1 1 1 1 1 1 1 f 1 1 f 1 1 1 1 1 1 Mill lllll II llllll I II I i , I . i 

Db 421 TCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCG 480 

Qy 615 GAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCC 674 

IIIMIIIIIIIMIM III III III II II lllll lllllll M MM 

Db 4 81 GAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCC 54 0 

Qy 675 ACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTG 734 

III MM IIIMIII lllllllllll II Mill II llllllllll I I I 

Db 541 ACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCG 600 

Qy 735 GCCGCCGGGGTGACCTGGCGACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGG 7 94 

iiiiiii 1 1 ; 1 1 : i 1 1 1 , , 1 1 1 1 : . 1 1 immmimmm iimmm mmmi 

Db 601 GCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGG 660 

Qy 795 CCACC C CG CTGG AG AGGG CC CAG CACCTG CACAGCT C CCGG CAC CG C CG AG C C 847 

IIIIIIIIMIIIMIIIIIIIIII lllll lllllll I I MMM 
Db 661 CCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCCGGCACCGCGAGCC 713 



RESULT 11 

BE260971 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BE260971 717 bp mRNA linear EST 26-OCT-2000 

601153715F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3509931 5\ 
mRNA sequence. 
BE260971 

BE260971 .1 GI : 9132709 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 717) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpubl ished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DMA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM193 row: m column: 04 
High quality sequence stop: 713. 
Locat ion/Qua 1 if iers 
1. .717 

/organ ism=" Homo sapiens" 

/ mo 1 _t ype = " mRNA " 

/db xref ="taxon: 9606" 



/clone= M IMAGE: 350 9931" 

/ 1 i s sue_type = " neurobl a s t oma " 

/lab_host="DHlOB (phage-resistant ) " 

/clone_lib="NIH_MGC_19" 

/note="Organ: brain; Vector: pOTB7; Site_l: Xhol ; Site_2 : 
EcoRI; cDNA made by oligo-dT priming. Direct ional ly 
cloned into EcoRI /Xhol sites using the following 5' 
adaptor: GGCACGAG(G) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 

BASE COUNT 158 a 226 c 213 g 120 t 

ORIGIN 



Query Match 41.7%; Score 562.6; DB 10; Length 717; 

Best Local Similarity 87.4%; Pred. No. 4.1e-112; 

Matches 627; Conservative 0; Mismatches 89; Indels 1; Gaps 1; 



Qy 


136 


GCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCC 

1 II M II 1 1 II 1 II M M 1 II 1 1 1 II 1 1 II 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 MINIM 


195 


Db 


1 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11,11 1 1 1 1 1 1 1 1 
GCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCC 


60 


Qy 


196 


CCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAA 


255 


Db 


61 


lllllllllll III IIIIIIIMIIMIIIIIII INN N II 1 1 II III 

CCCGAGCCAGG-GGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAA 


119 


Qy 


256 


CAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGA 


315 


Db 


120 


III 1 1 1 1 1 II 1 1 1 1 1 1 1 NIMIII MM 1 1 1 II 1 II II 1 1 1 1 II 1 Mill II 
CAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGA 


179 


Qy 


316 


CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGA 


375 


Db 


180 


MIMIMMIIMMIMIIMMIMMIIIMIIIIM 1 MM MIMMIM 

CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGA 


239 


Qy 


376 


TAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTa\ACACGTCGGAGCTCCGGGA 


435 


Db 


240 


M MINI 1 1 MUM MMMM 1 MMMM II II 1 1 II II II 

CAAGTTCAAGCAGAGTAC^CACAGCATATATATGTTCT 


299 


Qy 


436 


AGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTT 

MUM M Mill 1 1 MMMM 1 1 1 f 1 J 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 II 1 

AGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTT 


495 


Db 


300 


359 


Qy 


496 


AAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCT 

MIMMMMMMMIIMIM 1 1 1 1 ( I ! II 1 1 1 i 1 1 II MIMMIM Mill 

AAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCT 


555 


Db 


360 


419 


Qy 


556 


CAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATnTCACCGG 


615 


Db 


420 


IMMIMMIMIIMI INN Mill II MINI 1 II IMIMMIIMM 

C^GO^CCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGG 


479 


Qy 


616 


AGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCA 

IMMMMMIMM III Ml III II 1 1 Mill MMMM II Mill 

AGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCA 


675 


Db 


480 


539 


Qy 


676 


CTCTTCCTCTGA(^G(^^GATAA(^(^CTCavCGTGGAAATTAACGGGTTCAATTCTGG 


735 



II 1 1 1 1 MINIM lllllllllll II Mill II MIMMIM I I II 



Db 



54 0 CTGCTCCTGTGAC^GC^GGGATAA(^CACTGC^GTGGA(^TaACGGGTTCACTACCGG 5 9 9 



Qy 736 CCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGC 7 95 

Mill! IIIIIIIIIIIIIIIIIIII IIIMIIIIIIIII IIIIIIII IIIIIIII 

Db 600 CCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGC 659 

Qy 7 96 CACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGA 852 

IIIIIIIIIIIIIIIMIIMIII Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 M II Ml 

Db 66 0 CACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCGAGCCCTGGGA 716 



RESULT 12 
BQ675698 

LOCUS BQ675698 902 bp mRNA linear EST 15-JUL-2002 

DEFINITION AGENCOURT_8 036532 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 62 1 1917 

5 1 , mRNA sequence . 
ACCESSION BQ675698 

VERSION BQ675698.1 GI:21786532 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 902) 

AUTHORS NIH-MGC http://mgc.nci.nih.gov/. 

TITLE National Institutes of Health, Mammalian Gene Collection (MGC) 

JOURNAL Unpubl ished 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih .gov 
Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : //image . llnl . gov 
Plate: LLCM2375 row: c column: 22 
High quality sequence stop: 599. 
FEATURES Location/Qualifiers 
source 1. .902 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 96 06" 
/clone=" IMAGE: 6211917" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/clone_l ib= "NIH_MGC_1 02 " 

/note="Organ: salivary qland; Vector: pOTB7; Site__l : xhoT ; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5» adaptor: GGCACGAG (G) . Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript 1 1 RT (Life 
Technologies). Note: this is a NIH_MGC Library." 

BASE COUNT 194 a 292 c 255 g 161 t 

ORIGIN 



Query Match 41.6%; Score 561.6; DB 13; Length 902; 

Best Local Similarity 87.4%; Pred. No. 7.3e-112 ; 

Matches 615; Conservative 0; Mismatches 89; Indels 0; Gaps 0; 
Qy 156 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 215 

I IIIIMIIIII IIIMIIMII Mill IIMIIIMIIIIIMIIIIIII 1 1 II 

Db 1 T C CG CGG C CAG ATC CTGT C CAAG CTG CGG CTCG C CAG C C C C CCG AG C CAGGGGGAGGTG C 60 

Qy 216 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 275 

MIMMMMMIM Mill II II II II MINIM MMIMMIMM I 

Db 61 CGCCCGGC CCG CTG CCCGAGGCCGTG CTCG CC CTG TACAA CAG CACCCGCGACCGGGTGG 12 0 

Qy 2 76 CCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCA 33 5 

1 1 1 1 1 1 1 MM 1 1 f 1 1 1 1 1 1 r 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 121 CCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCA 180 

Qy 33 6 CCCGCGTGCTAATGGTGGAAAGCGGC^CCAAATCTATGATAAATTCAAGGGCACCCCCC 3 95 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM IMMIIMI II Mi 1 I I I 

Db 181 CCCGCGTGCTAATGGTGGAAACC(^(^CGAAATCTATGACAAGTTC^GaVGAGTACAC 24 0 

Qy 3 96 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 455 

Mill IIIIIMI I MMMM II MMMM IIMIMI II Mill II I 

Db 241 ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 3 00 

Qy 4 56 TG CT CTCT CGGG CAG AG CTG CG C CTG CTGAG G CTCAAGTTAAAAGTGGAG CAG CA CGTGG 515 

1 1 1 1 1 1 1 MMIMMIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 N 1 1 1 

Db 3 01 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCA CGTGG 360 

Qy 516 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 575 

MM MIIIIIIIIIIMIII 1 1 II II M II 1 1 1 ! 1 i 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 I 

Db 361 AGCTGTACCAGAAATACA.GCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCAC 420 

Qy 576 CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 63 5 

MM Mill M MUM I II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 Ml 

Db 421 CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 48 0 

Qy 636 CCCGCAGAGAGG CTATAGAGGGTTTTCG CCTCAGTGCC CACTCTTC CT CTGACAG CAAAG 695 

Ml III M II Mill MMMM II lllllll MM MMMM I 

Db 481 GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 54 0 

Qy 696 ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 755 

IMMIIMI II Mill M IMIIIMII I I MMMM IIIIIIMIMM 

Db 541 ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 600 

Qy 756 CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 815 

lllllll MMIMMIMM MMMM I II I II II M II 1 1 1 II 1 1 M II M I II 

Db 601 CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 66 0 

Qy 816 AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAAC 859 

MM MM MMIIIMIIIIMIIIIIIIMI I II 

Db 661 AGCATCTGCCAAGCTCCCGGCACCGCCGAGCCCTGGGACACCAC 704 



RESULT 13 
BI818841 

LOCUS BI818841 



925 bp mRNA linear EST 04-OCT-2001 



DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



603037307F1 NIH_MGC_115 Homo sapiens cDNA clone IMAGE : 5178433 5' 
mRNA sequence . 
BI818841 

BI818841 . 1 GI : 15930391 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 925) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM11444 row: n column: 02 
High quality sequence stop: 874. 
Location/Qualifiers 
1. .925 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 96 06" 
/clone=" IMAGE: 5178433" 
/lab_host="DH10B" 
/clone_lib="NIH_MGC_115" 

/note="0rgan: pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l: Not I; Site_2 : EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-27; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and directionally cloned (EcoRV site is 
destroyed upon cloning). Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH__MGC Library." 
170 a 316 c 278 g 161 t 



Query Match 3 9.6%; Score 534.4; DB 12; Length 92 5; 

Best Local Similarity 87.2%; Pred. No. 6e-106; 

Matches 609; Conservative 0; Mismatches 86; Indels 3; Gaps 



2 : 



Qy 

Db 

Qy 
Db 



6 CCGAGATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGC 6 5 

II III I I I I II lllllllllll lllllllllllll llllllllllllll I 
146 CCCCCATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTAC 2 05 

66 TAGTGCTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGG 125 

I 1 1 1 1 1 1 1 M 1 1 1 1 M f I II 1 1! M 1 1 Mill llllllllllllll llllllllll 

2 06 TGGTGCTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGG 265 



Qy 12 6 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGC 18 5 

1 1 1 1 1 1 1 1 . MM I II 1 1 1 1 M 1 1 1 ; 1 1 . MMMMIM MMMMMI MM 

Db 266 AGCTGGTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGC 32 5 

Qy 18 6 TTGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGG 24 5 

I MMMMMMMMMMMI MMMMMIMMMMI MMI M 1 1 I 

Db 326 TCGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCG 38 5 

Qy 24 6 CTCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGC 3 0 5 

I 1 1 MMIMI MMMMMMM MMMM MM MMMMMMIMI 

Db 38 6 CCCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGC 44 5 

Qy 3 06 CAGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACC 365 

I IMM MMMMMMMMMMMMMMMMMMMMMI I IMI 

Db 44 6 CTGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACG 5 05 

Qy 366 AAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGG 42 5 

MMIMMI M MMM I I MMM MMMM I MMMM M I 

Db 5 06 AAATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAG 565 

Qy 426 AGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGA 485 

lllllll MINIM II Mill II MMMM MMMMMMM lllllll 

Db 566 AGCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGA 62 5 

Qy 4 8 6 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCT 54 5 

IMMIMMMMMMMMMMMMIMI MMMMMMMMI MMM 

Db 626 GGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCT 68 5 

Qy 54 6 GGCGCTACCTCAGCAACCGGCTGC - TGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTT 604 

MM MMMMMMMMMI MM MMI MMI M MMM I M Ml 

Db 68 6 GGCGATACCTCAGCAACCGGCTGCTTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTT 74 5 

Qy 605 GATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGA - -GAGGGTATAGAGGGTTTTC 662 

IMIMMMMMMMMIMIMI III III II I I II MMI IMI 

Db 74 6 GATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGACGGGGAACATTGAGGGCTTTC 8 05 

Qy 663 GCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAAC 70 0 

MM II lllllll MM MMMM MM I 

Db 8 06 GCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATACC 843 



RESULT 14 

AL530080/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AL530080 841 bp mRNA linear EST 23-MAY-2003 

AL530080 Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED Homo sapiens 
cDNA clone CS0DD009YM06 3 -PRIME, mRNA sequence. 
AL53 008 0 

AL53 0 080 .2 GI : 31067915 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Euthena,- Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 841) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 



COMMENT On Feb 13, 2001 this sequence version replaced gi: 12793573. 

Contact: Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http: //www. genoscope. ens . fr/ 

cgi -bin/cluster . cgi?seq=CSODD009BG03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODD009BG03NP1 . 
FEATURES Locat ion/Qual i f lers 

source 1. .841 

/organ ism=" Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone="CS0DD0 0 9YM06" 

/tissue_type= "NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 50 -NORMAL I ZED" 
/note="lst strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 150 a 240 c 283 g 153 t 15 others 

ORIGIN 

Query Match 3 9.6%; Score 533.8; DB 9; Length 841; 

Best Local Similarity 81.2%; Pred. No. 7.9e-106; 

Matches 687; Conservative 8; Mismatches 106; Indels 45; Gaps 5; 
Qy 438 CGGTGCCGGAACCTGTATTGCTCT-CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTA 4 96 

1 1 1 1 II hill I 1 1 h 1 1 1 I 1 1 1 1 II 1 1; I III Mill MINIMUM 

Db 841 CGGTACCTGMACCCTTGTTGYTCTCCCCGGGCAGAGCTGCTTCTGCTAAGGCTCAAGTTA 782 

Qy 4 97 AAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTC 55 6 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M I II I M I M M I 

Db 781 AAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC^GCAA 722 

Qy 557 AGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGA 616 

llllllhll MUM Mill Mill II M 1 1 1 1 I II MM IMIIMIM 

Db 721 AGCAACCSGCSGCTGGCACCCAGCSACTCGCCAGAGTGGTTTTCTTTTGTTGTCACCGKA 662 

Qy 617 GTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCAC 676 

MM II 1 1 1 1 1 1 1 1 1 Ml III III II II III MIMIII II III M 

Db 661 GTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTAAGG - -TTTCGCCTTAGCGCCNAC 604 

Qy 677 TCTTCCTCTGAC^GCAAAGATAACAC^CTCa\CGTGGAAATTAACGGGTTCAATTCTGGC 736 

MM I MUM 1 1 1 1 1 1 1 1 1 1 ) II III I II MIMIII I I I III 

Db 603 GGCTCCTGTAACAGCAGGGATAACAC^CTGCAAGTGAAC 544 

Qy 737 CGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCC 7 96 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 MMIMMIMM 1 1 1 1 1 1 M IMIIIMI 

Db 54 3 CGCCGAGGTGACCTGGCCACCATTCTTGGCATGAACCGGCCTTTCCTGCTTCTCATGGCC 4 84 



Qy 



797 ACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACC 856 



Db 


483 


ACCCCGCTGGAGAGGGCCCAGCTTCTGCAAAGCTCCCGGCACCGCCGA 


436 


Qy 


857 


AACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACG 

MINIM MMMM MIMMIMIMM 

GCCCTGGACACCAACTATTG CTTCAG CTCCACG 


916 


Db 


435 


403 


Qy 
Db 


917 
402 


GAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MUM 1 1 1 1 1 1 1 MMMM MUM II 

GAGAAGAACTGCTGCGTGCGGCAGCTGTACATTAACTTCCGCAAGGACCTCGGCTGGGAG 


976 
343 


Qy 


977 


TGGATTCATGAACCCAAGGGCTACCATGCCAATTTGTGCCTGGGGCCCTGTCCCTACATC 


1036 


Db 


342 


MMI 1 1 M MMIMMhlllllMM MMMM MMMM MMMM 

TGGATCCACGAGCCCAAGGGCTWCCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATT 


283 


Qy 


1037 


TGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGC 

IMMMI Mill Ml MIMIIIIIMMII MIIIIIIIMIII MIMM 

TGGAGCCTGGACACGCAGGACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGG-- 


1096 


Db 


282 


225 


Qy 


1097 


GCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTAC 

II IMIIIIIIMIIII MIMMIMMMMMIIM MMI MIMMIMI 

GCCTCGGCGGCGCCGTGC - GCGTGCCGCAGGCGCTGGAGCCGCTGCCCTTCGTGTACTAC 


1156 


Db 


224 


166 


Qy 

Db 


1157 
165 


GTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGC 
I i I i I I I I i i i i i i i i i i i i i i ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii 

M M II M M M M II M M M M II 1 II II M M M MIMM II II M II 1 II 

GTGGGCCGCAAGCCCAAGGTGGGGCGGCTGTCCAACATGGTGGTGCGCTCCTGCAAGGGC 


1216 
106 


Qy 


1217 


AGCTGAGGCGCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGC 

MM II 1 1 1 1 f 1 1 i 1 1 1 1 MINIM Mill MM MM 1 

CGCTGGGGTCCCGCCCCGCCCCGCCCCGCCCCGGCAGGCCCGGCCCCCCCCCGCCCCGCC 


1276 


Db 


105 


46 


Qy 


1277 


CCGCCT 1282 




Db 


45 


IM II 
CCKGCT 4 0 





RESULT 15 

BI084718/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



bp mRNA 
sapiens cDNA 



linear EST 20-JUN-2001 
clone IMAGE: 5014421 3', 



BI084718 956 
602869722T1 NIH_MGC__102 Homo 
mRNA sequence. 
BI084718 

BI 084 718 . 1 GI : 14 503 04 8 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 956) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs -r@mail . nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: CLONETECH Laboratories, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort lum/LLNL at: 
http: //image . llnl .gov 
Plate: LLCM1821 row: 1 column: 06 
High quality sequence start: 3 
High quality sequence stop: 793. 
FEATURES Locat ion/Qual l f iers 

source 1. .956 

/organism="Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 96 06" 
/clone=" IMAGE: 5014421" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant ) " 
/ c 1 one_l ib="NI H _MGC_1 0 2 " 

/note="Organ: salivary gland; Vector: pOTB7; Site_l: Xhol ; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG(G). Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 

BASE COUNT 172 a 2 73 c 312 g 199 t 

ORIGIN 

Query Match 39.5%; Score 533.4; DB 12; Length 956 ; 

Best Local Similarity 77.8%; Pred . No. le-105; 

Matches 755; Conservative 0; Mismatches 161; Indels 55; Gaps 7,- 
Qy 329 GAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^CC^AATCTATGATAAATTCAAGGGC 388 

Mill I Ml II I M M I II I I I II II I MM I 

Db 955 GAGGTTAACCGGTGCTAAATGGTGGAAACCCACCACGAATCTATGACCAGGTTCAGCAGA 8 96 

Qy 389 ACCCCCCACAGCTTATATATGCTGTTCAACACGTC- -GGAGCTCCGGGAAGCGGTGCCGG 44 6 

II I II Mill I Mil M I MM III MM II 

Db 895 GTACACACAGCCATAAATATGGTCTTCACACATTC^GGAGC 836 
Qy 44 7 AA C CTG T ATTG CTCTCTCGGG CAG AG CTG CG CCTG CTGAGGCTCAAGTTAAAAGT 501 

I I II Ml II I II II I I MINI 1 1 ! 1 , 1 1 1 1 1 ! 1 1 1 

Db 835 GAACCGTGTTGGCTCTTCCCGGGGAAGAGCTGCCGTCTGCCTGAGGCTCAAGTTAAAAGT 776 

Qy 5 02 -GGAGC^GC^CGTGGAGCTATACOVGAAATACAGCAATGATTCCTGGCGCTACCTCAGCA 560 

Ml I M M II II M I II Mil I MMIIIMI MMIIIMI 

Db 775 GGGAC(^GCACGTGGAGCTGTAACC^GGAATACAG(^CAATTCCTGGCGATACCTCAGCA 716 

Qy 561 ACCGGCTGCTGGCCCCCAGTGA - - CTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGT 618 

MMMM I II I Mill M II IMMI I II IIIIIIMIIMMIII 

Db 715 ACCGGCTGGTGGCACCCAGCGAACTCGCCCAGAGTGGTTATCTTTTGATGTCACCGGAGT 656 

Qy 619 TGTGCGGCAGTGGCTGACCCGCAG-AGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 677 

IIIIIIMIIIII III Ml I II II II Mill MMMM II MINI 

Db 655 TGTGCGGCAGTGGTTGAGCCGTGGAAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACT 596 

Qy 678 CTTCCTCTGACAGCAAAGATAAC^CACTCC^ 737 

MM IMIIMI II 1 1 II M I II II Mill II IMIIMIII I I MM 



Db 595 GCTCCTGTGACAGCAGGGATAAC&CACT 536 

Qy 73 8 GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 797 

1 1 II IMIMIMIIMIIIIMI llllllllllllll II M II 1 1 IIMIIIIII 

Db 535 GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 4 76 



Qy 798 CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 857 

'II 1 , 1 1 1 li . II 1 1 1 . II 1 1 1 lllll 1 1: 1 II 1 1 1 , II 1 1 1 1 1 1 1 1 , 1 1 1 1 

Db 4 75 CC C CG CTGG AG AGGG C C CAG CAT CTG CAAAGCT C C CGG CA C CG C CGAG C CCTGG A C 420 



Qy 858 ACGACTACAAGGATGACGACGACAAGGCCCTGGATACCAACTACTGCTTCAGCTCCACGG 917 

Illlllll I I I I I I I I I I ' I I I I 
Db 419 ACCAACTATTGCTTCAGCTCCACGG 3 95 

Qy 918 AGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGT 977 

I II I M M 1 1 1 II 1 1 1 M M M M I llllllllllllll Illlllll IIMIIIIII 

Db 3 94 AGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGT 3 3 5 

Qy 978 GGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCT 1037 

MM II II llllllllllllllllllll Illlllll Illlllll Illlllll I 

Db 334 GGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTT 2 75 

Qy 103 8 GGAGCCTAGAC^CTCAGTACAGCAAGGTCCTGGCTCTGTACAACC^GC^a^CCCGGGCG 1097 

IIIMII Mill IIIIIIIMIIIIMMMI llllllllllllll IIIIIMIM 

Db 274 GGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCG 215 

Qy 1098 CGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACG 1157 

I 1 1 1 1 1 1 1 II 1 1 ! f 1 1 1 M 1 1 M II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 11 1 1 1 M M 1 1 1 1 1 1 1 

Db 214 CCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACG 155 

Qy 1158 TGGGCCG(^GCCCAAGGTGGAG(^GCTGTCC^ACATGATCGTGCGTTCCTGCAAGTGCA 1217 

1 1 1 1 1 Ml 1 1 1 1 1 1 i i M II 1 1 1 1 M, M I II I M 1 1 1 M 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 

Db 154 TGGGCCGCAAGCCO^GGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCA 95 

Qy 1218 GCTGAGGCCCCGCCCCGCCCACAGCCCCGCCCACCCGGCAGGCCCGGCCCCACCCCCGCC 1277 

lllllll I II 1 1 f 1 1 1 1 1 II I II lllllllllllllllllllllll M 

Db 94 GCTGAGGTCCCGCCCCGCC CCGCCCCGCCCCGGCAGGCCCGGCCCCACCCCGCCC 4 0 



Qy 1278 CGCCTCACCGG 12 8 8 

MM I I I 
Db 3 9 CGCCCCCGCTG 2 9 



Search completed: October 28, 2003, 00:08:20 
Job time : 3352.62 sees 



